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METHOD FOR PRODUCING TRANSGENIC BIRDS AND FISH 

Statement of Government Support 
This invention was made with government support under Grant Number GM39458 
awarded by the National Institutes of Health. The United States Government has certain 
rights in the invention. 



Background of the Invention 

10 

Field of the Invention 

The present invention relates to methods for generating transgenic animals, 
particularly transgenic birds and fish, using viral constructs engineered to carry the 
transgene(s) of interest. 

15 

Description of the Related Art 

Early transgenic experiments used an oncoretrovirus to introduce the gene of 

interest into embryonic cells (Jaenisch Proc. Natl. Acad. Sci. USA 73:1260-1264 (1976)). 

In a typical experiment an engineered Moloney strain of mouse leukemia virus (MoMLV) 
20 was injected into the blastocyst cavity of mice. While the transgene was often integrated 

into the genome of the resulting mice, no gene expression could be detected. 

Today, the majority of transgenic animals are made using direct injection 

technology (Gordon and Ruddle Science 214:1244-1246 (1981)). Briefly, a DNA construct 

carrying the gene of interest is injected directly into the pronucleus of a single-cell zygote. 
25 The cell is then implanted into a pseudo-pregnant female and the resulting progeny is 

analyzed for expression of the gene. 

While this method achieves both integration and expression of the transgene, there 

are a number of significant drawbacks to the direct injection technique. First, in order to 

carry out the technique it is necessary to inject DNA directly into the pronucleus. This is 
30 possible in some specific strains of mice, such as Black6 x BDA, because the male 

pronucleus is visible. However, in other animals and other strains of mice the pronucleus is 

less visible, making the technique extremely difficult. Further, the injection requires the 
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assistance of a skilled technician and a significant investment in equipment; 
micromanipulators are necessary to hold the cell and the injection pipette and a pressure 
source is required that can deliver picoliter amounts of DNA solution. 

A second equally significant problem with the direct injection method is the low 
5 percentage of injected zygotes that produce transgenic animals. The injection pipette must 

go through the zona pellucida, the cell membrane and the nuclear envelope. Thus only 80- 
90% of mouse cells survive the injection. Other animal cells are less hardy and the survival 
rates are somewhat lower, with about 60% survival for rats and 40-50% for cows. In mice, 
of the original zygotes, about 25% are successfully injected and implanted in a 

10 pseudopregnant female. About 20% of the resulting animals have the transgene integrated 

into their genome. Of these, about 20% will express the gene. However, even if the 
animals express the gene, it is possible that the expression pattern will not be useful. Thus, 
only about 1% of injected zygotes result in transgenic animals that express the gene of 
interest. This low efficiency of transgenesis is particularly troubling for larger animals, 

15 such as pigs, cows or goats, in which obtaining large numbers of embryos is not possible 

(see, e.g., Wall et al. J. Cell. Biochem. 49:1 13 (1992)). 

In addition, direct pronuclear injection is not possible for many types of animals, 
including birds. 

20 Summary of the Invention 

One aspect of the present invention concerns a method of producing a transgenic 
animal, preferably a transgenic bird or a transgenic fish. In one embodiment a method is 
provided for producing a transgenic bird, the method comprising transfecting a packaging 
cell line with a retroviral construct, recovering recombinant retrovirus, and infecting a 
25 fertilized bird egg with the recombinant retrovirus. 

In one embodiment infecting a bird egg comprises contacting the embryonic 
blastodisc of the bird egg with the retroviral particles. 

The retroviral construct preferably comprises the R and U5 sequences from a 5' 
lenti viral long terminal repeat (LTR) and a self-inactivating lenti viral 3* LTR. Further, the 
30 self-inactivating 3' LTR preferably comprises a U3 element with a deletion of its enhancer 

sequence. In one embodiment the LTR sequences are from HIV. 



-2- 



WO 03/022228 



PCT/US02/29157 



In one embodiment the retroviral construct comprises a gene that is desirably 
expressed in the transgenic bird. In this embodiment the retroviral construct may also 
comprise an internal promoter and/or enhancer. In one embodiment the internal promoter 
is ubiquitous. The ubiquitous promoter may be any ubiquitous promoter known in the art. 
5 For example, the ubiquitous promoter may be selected from the group consisting of the 

ubiquitin promoter, the CMV P-actin promoter and the pgk promoter. In another 
embodiment the internal promoter is tissue-specific. The tissue specific promoter may be 
any tissue specific promoter known in the art, for example, a promoter selected from the 
group consisting of the lck promoter, the myogenin promoter and the thyl promoter. 

10 In addition, the recombinant retrovirus may be pseudotyped. Thus, in one 

embodiment the recombinant retrovirus is pseudotyped with the vesicular stomatitis virus 
envelope glycoprotein. 

The viral construct may comprise one or more additional genetic elements. In one 
embodiment the viral construct comprises a promoter operably linked to the R and U5 5' 

1 5 LTR sequences, preferably a CMV promoter sequence. An enhancer, preferably a CMV 

enhancer sequence, may also be included in the viral construct. 

In another embodiment the viral construct comprises a woodchuck hepatitis virus 
enhancer element sequence. In yet another embodiment the viral construct comprises a 
tRNA amber suppressor sequence. 

20 The viral construct may additionally comprise a reporter gene operably linked to the 

internal promoter. The reporter gene may encode be a fluorescent protein, preferably green 
fluorescent protein. 

In another aspect the invention concerns a method of producing a transgenic bird 
comprising opening a window in the shell of a fertilized bird egg, injecting modified 

25 retrovirus into the space between the perivitelline membrane and the embryonic blastodisc 

and incubating the embryo until hatching. The modified retrovirus is preferably a modified 
lentivirus. The modified lentivirus is preferably produced by transfecting a packaging cell 
line with a viral construct. In one embodiment the viral construct comprises the R and U5 
sequences from a lentiviral 5* LTR, an internal promoter, a gene of interest and a self 

30 inactivating lentiviral 3* LTR. 

In yet another aspect the invention concerns a transgenic bird made by any of the 
disclosed methods. Thus, the transgenic bird preferably comprises one or more cells, 
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preferably germ cells, that comprise proviral DNA. The proviral DNA may comprise a 
self-inactivating lenti viral 3* LTR, such as a self-inactivating HIV 3' LTR. In particular, 
the self-inactivating 3' LTR may have a deletion of its enhancer sequence. 

In another embodiment a transgenic fish is produced by a method that comprises 
5 transfecting a packaging cell line with a viral construct, recovering recombinant retroviral 

particles, and infecting a fish egg with the recombinant retroviral particles. Preferably the 
viral construct comprises the R and U5 sequences from a 5' lentiviral LTR and a self- 
inactivating 3* lentiviral LTR. Infecting the fish egg preferably comprises delivering the 
retroviral particles to the space between the chorion and the cell membrane of the fish egg. 

10 

Brief Description of the Drawings 
Figure 1 A is a diagram of the FUGW viral construct. Figure IB is a diagram of the 
provirus that is integrated into the host genome after infection with recombinant virus 
prepared with the FUGW viral construct of Figure 1 A. 

15 Figure 2 is a Southern blot analysis of proviral transgene insertions in the founder 

generation of mice generated by injecting recombinant lentivirus into the perivitelline space 
of one-cell embryos. Genomic DNA from each animal was digested with either PstI (left) 
or BamHI (right), and probed with a GFP+WRE sequence. All PstI and BamHI sites in the 
provirus are located 5* to the GFP gene. Plus signs above each lane indicate GFP 

20 expression in the animal detectable by viewing under conventional epi fluorescence. 

Figure 3 is a Southern blot analysis of proviral transgene insertions in the founder 
generation of a second group of mice generated by injecting recombinant lentivirus into the 
perivitelline space of one-cell embryos. Genomic DNA from each animal was digested 
with BamHI and probed with a GFP-i-WRE sequence. All BamHI sites in the provirus are 

25 located 5* to the GFP gene. Plus signs above each lane indicate GFP expression in the 

animal detectable by viewing under conventional fluorescence. Of the 56 founder animals 
in this experiment, 45 or 80.4% are transgenic. Of these 45 transgenic animals, 41 or 
91.1% express GFP at detectable levels. Lanes marked "C" are positive plasmid controls. 

Figure 4 shows ubiquitous GFP expression in rats derived from the delivery of 

30 modified lentivirus to single-cell embryos in vitro. Figure 4A shows brightfield (BF) and 

fluorescent images of the paws of newborn rats derived from a FUGW-injected embryo. 
Pup R4, carrying 4 copies of the proviral insert, expresses GFP in the paw, as well as all 
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other tissues and organs examined. A littermate (R3) carrying no transgene is included for 
comparison. Figure 4B shows a Southern blot analysis of provirat insertions in rats 
generated by injection of FUGW lentivirus into the perivitelline space of single-cell 
embryos. Genomic DNA was digested with PstI and hybridized with a GFP+WRE probe. 
5 Plus signs above each lane indicate GFP expression in the animal detectable by direct 

viewing under a fluorescent microscope. 

Figure 5 is a Southern blot analysis of proviral transgene insertions in the founder 
generation of mice generated by incubating denuded embryos in media comprising 
recombinant lentivirus produced with the FUGW viral construct. Genomic DNA from each 

10 animal was digested with PstI and probed with a GFP+WRE sequence. All PstI sites in the 

pro virus are located 5' to the GFP gene. The ratios above the lanes indicate the dilution of 
the virus from 1 x 10 6 pfu/jil. 

Figure 6 shows GFP expression in major tissues and organs of a founder mouse. 
The mouse was perfused intracardially with PBS, pH 7.4, and then 3% paraformaldehyde, 

15 and viewed immediately under a fluorescent dissecting microscope. The particular mouse 

shown was generated by co-incubation of the denuded embryo with the lentiviral 
suspension and contains 8 proviral insertions. A wildtype animal, identically perfused and 
photographed is included for comparison. 

Figure 7 is a Southern blot analysis of proviral transgene insertions in the Fl 

20 progeny of founder transgenic mice, showing that the Fl progeny inherit the proviral 

transgene in a Mendelian fashion. The founder mice were generated by injection of FUGW 
lentivirus into the perivitelline space of single-cell embryos. Genomic DNA from each 
animal was digested with BamHI and probed wiht GFP+WRE sequence. All BamHI sites 
are located 5' to the GFP gene. The first numbered lane in each group is the P0 founder 

25 animal, while the lettered lanes represent progeny resulting from outcrossing that founder 

animal to a wildtype animal. Plus signs above each lane indicate GFP expression in that 
animal detectable by direct viewing of the live animal under a conventional epifluoresence 
microscope. 

Figure 8 shows that transgenic mice give rise to transgenic progeny that express the 
30 transgene. This indicates that the transgene can go through an entire round of 

gametogenesis and development without being silenced. Expression of the transgene was 
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determined based on GFP expression in the newborn pup. The pup imag ed here is 
descended from an animal with 10 proviral insertions. 

Figure 9 is a Southern blot analysis of proviral transgene insertions in the founder 
generation of mice generated using a lentiviral vector containing the myogenin promoter 
dnvng a histone2B-GFP fusion. Embryos were recovered from the uterus at embryonic 
day 11.5 ("El 1.5"). The litter consisted of 6 animals, all of which were transgenic. 
Genomic DNA from each animal was digested with BamHI and probed with a GFP+WRE 
sequence. A BamHI site is located within the histone2B gene, 5' of the GFP sequence in 
the provirus. Plus signs above the lanes indicate positive plasmid controls. Animals 5 and 
6 were positive for tissue-specific GFP expression at embryonic day 1 1.5 when viewed as a 
whole mount under an inverted fluorescent microscope, and animal 5 expressed more 
highly than animal 6. 

Figure 10 shows the GFP expression pattern in an El 1.5 mouse embryo derived 
from the perivitelline space injection of lentivirus carrying a histone2B-GFP fusion 
construct under the control of the myogenin promoter (Yee et al. Genes and Dev. 6:1277- 
1289 (1993)). GFP expression is localized to the somites and can be seen in the emerging 
muscles in the limb buds, eye and jaw. 

Figure 11 shows immunofluorescence with an antibody against GFP in a cross- 
section through an El 1.5 embryo carrying the myogenin promoter driving GFP expression 
Embryos were derived from single-cell zygotes injected with recombinant lentivirus in the 
permtelhne space. Embryos were fixed in 3% paraformaldehyde, cryoprotected in 30% 
sucrose overnight and 30 urn sections were cut on a cryostat. Sections were incubated with 
a polyclonal antibody against GFP and probed with a-rabbit secondary antibody conjugated 
to a rhodamine fluorophore. Images on the left are sections as viewed under a rhodamine 
filter, while images on the right show the nuclear counterstain Hoechst-33342 for each 
corresponding section. The anima. carried 6 proviral insertions of the myogenin-GFP 
construct. Specific staining of somite tissues can be seen, with the exclusion of the stain 
from flanking skin and bone tissues. 

Figure 12 also shows immunofluorescence with an antibody against GFP in cross 
sect,ons of an El 1.5 embryo carrying a myogenin promoter driving H2B-GFP. Lack of 
staining in the viscera is noteworthy. 
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Figure 13 also shows immunofluorescence with an antibody against GFP in cross 
sections of an El 1.5 embryo carrying a myogenin promoter driving H2B-GFP. Specific 
staining of somites on either side of the neural tube can be visualized. 

Figure 14 shows H2B-GFP expression in the extraembryonic tissue of developing 
5 zebra finch. 

Figure 15 shows H2B-GFP expression inside of the zebra finch embryo. 

Figure 16 shows the nucleotide sequence of GFP. 

Figure 17 shows the nucleotide sequence of H2B-GFP. 

Figure 18A shows the nucleotide sequence of HIV NL4.3 flap and 18B shows the 
10 nucleotide sequence of WRE. 

Figure 19A shows the nucleotide sequence of the myogenin promoter and 19B 
shows a partial nucleotide sequence of the Lck promoter. 

Figure 20 shows the nucleotide sequence of the human ubiquitin promoter. 

Figure 21 shows the nucleotide sequence of the HIV-1 flap + ubiquitin + GFP + 
15 WRE construct. 

Detailed Description of the Preferred Embodiment 
Retroviruses are enveloped RNA viruses that are capable of infecting animal cells. 
When a retrovirus infects a cell, its RNA genome is converted into a double-stranded linear 
20 DNA form by reverse transcription. The DNA form is integrated into the host cell genome 

as a pro virus. The present invention is based on the discovery that recombinant retroviruses 
can be used to create transgenic animals. Transgenic animals resulting from the methods of 
the present invention have one or more copies of the transgene of interest integrated in their 
genome. 

25 Previous transgenic technology is not commercially practical in larger animals, such 

as monkeys, dogs, poultry, cows, pigs or sheep. Furthermore, previous transgenic methods 
are not applicable to poultry. Thus, the methods of the present invention will find great 
commercial application, for example in biotechnology and agriculture. The present 
methods may be used to introduce the gene of choice into animals in order to confer upon 

30 them desired attributes. For example, the described methods may be used to confer disease 

resistance. In biotechnology, the ability to rapidly develop large numbers of transgenic 
animals, particular higher order animals such as monkeys, will allow for the analysis of 
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gene function and the evaluation of compounds that potentially modulate gene expression, 
protein function, or are useful in treating a disease or disorder. Two types of assays in 
which the methods of the present invention are particularly useful are gene trap assays and 
large-scale mutagenesis screens, each of which is described below. 

5 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Any methods, devices and materials similar or equivalent to those 
described herein can be used in the practice of this invention. 

10 By "transgene" is meant any nucleotide or DNA sequence that is integrated into one 

or more chromosomes of a host cell by human intervention, such as by the methods of the 
present invention. In one embodiment the transgene comprises a "gene of interest." A 
"gene of interest" is a nucleic acid sequence that encodes a protein or other molecule that is 
desirable for integration and/or expression in a host cell. In this embodiment the gene of 

1 5 interest is generally operatively linked to other sequences that are useful for obtaining the 

desired expression of the gene of interest, such as transcriptional regulatory sequences. In 
another embodiment the transgene can be a DNA sequence that is used to mark the 
chromosome where it has integrated. In this situation, the transgene does not have to 
comprise a gene that encodes a protein that can be expressed. This use of the transgene as a 

20 molecular tag has numerous applications, for example for mutagenesis studies as described 

below. 

The term "transgenic" is used herein to describe the property of harboring a 
transgene. For instance, a "transgenic organism" is any animal, including mammals, fish, 
birds and amphibians, in which one or more of the cells of the animal contain nucleic acid 
25 introduced by way of human intervention, such as by the methods described herein. In the 

typical transgenic animal, the transgene causes the cell to express or overexpress a 
recombinant protein. However for some applications, such as the mutagenesis studies 
described below, it is not necessary or desirable for the transgenic organism to express a 
recombinant protein. 

30 The terms "founder," "founder animal" and "founder line" refer to those animals 

that are mature products of the embryos or oocytes to which the transgene was added, i.e. 
those animals that grew from the embryos or oocytes into which DNA was inserted. 
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The terms "progeny" and "progeny of the transgenic animal" refer to any and all 
offspring of every generation subsequent to the originally transformed animal. 

The term "animal" is used in its broadest sense and refers to all animals including 
mammals, birds, fish, reptiles and amphibians. 
5 The term "mammal" refers to all members of the class Mammalia and includes any 

animal classified as a mammal, including humans, domestic and farm animals, and zoo, 
sports or pet animals, such as mouse, rabbit, pig, sheep, goat, cattle and higher primates. 

The term "oocyte" refers to a female gamete cell and includes primary oocytes, 
secondary oocytes and mature, unfertilized ovum. As used herein, the term "egg" when 
10 used in reference to a mammalian egg, means an oocyte surrounded by a zona pellucida. 

The term "zygote" refers to a fertilized ovum. The term "embryo" broadly refers to an 
animal in the early stages of development. 

"Perivitelline space" refers to the space located between the zona pellucida and the 
cell membrane of a mammalian egg or embryonic cell. 
1 5 "Target cell" or "host cell" means a cell that is to be transformed using the methods 

and compositions of the invention. 

"Lentivirus" refers to a genus of retroviruses that are capable of infecting dividing 
and non-dividing cells. Several examples of lentiviruses include HIV (human 
immunodeficiency virus; including HIV type 1 , and HIV type 2), the etiologic agent of the 
20 human acquired immunodeficiency syndrome (AIDS); visna-maedi, which causes 

encephalitis (visna) or pneumonia (maedi) in sheep, the caprine arthritis-encephalitis virus, 
which causes immune deficiency, arthritis, and encephalopathy in goats; equine infectious 
anemia virus, which causes autoimmune hemolytic anemia, and encephalopathy in horses; 
feline immunodeficiency virus (FIV), which causes immune deficiency in cats; bovine 
25 immune deficiency virus (BIV), which causes lymphadenopathy, lymphocytosis, and 

possibly central nervous system infection in cattle; and simian immunodeficiency virus 
(SIV), which cause immune deficiency and encephalopathy in sub-human primates. 

A lentiviral genome is generally organized into a 5' long terminal repeat (LTR), the 
gag gene, the pol gene, the env gene, the accessory genes (nef, vif, vpr, vpu) and a 3' LTR. 
30 The viral LTR is divided into three regions called U3, R and U5. The U3 region contains 

the enhancer and promoter elements. The U5 region contains the polyadenylation signals. 
The R (repeat) region separates the U3 and U5 regions and transcribed sequences of the R 
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region appear at both the 5' and 3' ends of the viral RNA. See, for example, "RNA Viruses- 
A Practical Approach" (Alan J. Cann, Ed., Oxford University Press, (2000)), O Narayan 
and Clements J. Gen. Virology 70:1617-1639 (1989), Fields et al. Fundamental Virology 
Raven Press. (1990), Miyoshi H, Blomer U, Takahashi M, Gage FH, Verma IM J Virol 
72(10):8150-7 (1998), U.S. Patent No. 6,013,516. 

"Virion," "viral particle" and "retroviral particle" are used herein to refer to a single 
varus comprising an RNA genome, pal gene derived proteins, gag gene derived proteins 
and a hp ld b.layer displaying an envelope (glyco)protein. The RNA genome is usually a 
recombinant RNA genome and thus may contain an RNA sequence that is exogenous to the 
natave viral genome. The RNA genome may also comprise a defective endogenous viral 
sequence. 

A "pseudotyped" retrovirus is a retroviral particle having an envelope protein that is 
from a virus other than the virus from which the RNA genome is derived. The envelope 
protem may be from a different retrovirus or from a non-retroviral virus. A preferred 
envelope protein is the vesicular stomatitius virus G (VSV G) protein. However to 
ehmmate the possibility of human infection, viruses can alternatively be pseudotyped with 
ecotrop lc envelope protein that limit infection to a specific species, such as mice or birds. 

The term "provirus" is used to refer to a duplex DNA sequence present in a 
eukaryot.c chromosome that corresponds to the genome of an RNA retrovirus The 
proves may be transmitted from one cell generation to the next without causing lysis or 
destruction of the host cell. 

A "self-inactivating 3' LTR" is a 3' long terminal repeat (LTR) that contains a 
mutanon, substitution or deletion that prevents the LTR sequences from driving expression 
of a downstream gene. A copy of the U3 region from the 3' LTR acts as a template for the 
generation of both LTR's in the integrated provirus. Thus, when the 3' LTR with an 
mactwatmg deletion or mutation integrates as the 5' LTR of the provirus, no transcription 
from the 5' LTR is possible. This eliminates competition between the viral 
enhancer/promoter and any internal enhancer/promoter. Self-inactivating 3' LTRs are 
descnbed, for example, in Zufferey et al. J. Virol. 72:9873-9880 (1998), Miyoshi et al. J. 
Virol. 72:8150-8157 and Iwakuma etal. Virology 261:120-132 (1999). 
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In one aspect of the invention, a recombinant retrovirus is used to deliver a 
transgene of interest to a cell, preferably an oocyte or an embryonic cell, more preferably a 
one-cell embryo. The transgene, and any associated genetic elements, are thus integrated 
into the genome of the host cell as a provirus. The cell may then be allowed to develop into 

5 a transgenic animal. 

The recombinant retrovirus used to deliver the transgene is preferably a modified 

lentivirus, and thus is able to infect both dividing and non-dividing cells. The recombinant 

retrovirus preferably comprises a modified lentiviral genome that includes the transgene. 

Further, the modified lentiviral genome preferably lacks endogenous genes for proteins 
10 required for viral replication, thus preventing replication in the transgenic animal. The 

required proteins are provided in trans in the packaging cell line during production of the 

recombinant retrovirus, as described below. 

In the preferred embodiment the transgene is incorporated into a viral construct that 

comprises an intact retroviral 5' LTR and a self-inactivating 3' LTR. The viral construct is 
1 5 preferably introduced into a packaging cell line that packages viral genomic RNA based on 

the viral construct into viral particles with the desired host specificity. Viral particles are 

collected and allowed to infect the host cell. Each of these aspects is described in detail 

below. 

20 The Viral Construct 

The viral construct is a nucleotide sequence that comprises sequences necessary for 
the production of recombinant retrovirus in a packaging cell. In one embodiment the viral 
construct additionally comprises genetic elements that allow for the desired expression of a 
gene of interest in the host. 

25 Generation of the viral construct can be accomplished using any suitable genetic 

engineering techniques well known in the art, including, without limitation, the standard 
techniques of restriction endonuclease digestion, ligation, transformation, plasmid 
purification, and DNA sequencing, for example as described in Sambrook et al. (Molecular 
Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, N.Y. (1989)), 

30 Coffin et al. (Retroviruses. Cold Spring Harbor Laboratory Press, N.Y. (1997)) and "RNA 

Viruses: A Practical Approach" (Alan J. Cann, Ed., Oxford University Press, (2000)). 
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The viral construct may incorporate sequences from the genome of any known 
organism. The sequences may be incorporated in their native form or may be modified in 
any way. For example, the sequences may comprise insertions, deletions or substitutions. 
In the preferred embodiment the viral construct comprises sequences from a lentivirus 
genome, such as the HIV genome or the SIV genome. 

The viral construct preferably comprises sequences from the 5' and 3' LTRs of a 
lentivirus. More preferably the viral construct comprises the R and U5 sequences from the 
5' LTR of a lentivirus and an inactivated or self-inactivating 3' LTR from a lentivirus. The 
LTR sequences may be LTR sequences from any lentivirus from any species. For example, 
they may be LTR sequences from HIV, SIV, FIV or BIV. Preferably the LTR sequences 
are HIV LTR sequences. 

The viral construct preferably comprises an inactivated or self-inactivating 3' LTR. 
The 3' LTR may be made self-inactivating by any method known in the art. In the 
preferred embodiment the U3 element of the 3' LTR contains a deletion of its enhancer 
sequence, preferably the TATA box, Spl and NF-kappa B sites. As a result of the self- 
inactivating 3' LTR, the provirus that is integrated into the host cell genome will comprise 
an inactivated 5* LTR. 

Optionally, the U3 sequence from the lentiviral 5' LTR may be replaced with a 
promoter sequence in the viral construct. This may increase the titer of virus recovered 
from the packaging cell line. An enhancer sequence may also be included. Any 
enhancer/promoter combination that increases expression of the viral RNA genome in the 
packaging cell line may be used. In the preferred embodiment the CMV enhancer/promoter 
sequence is used. 

In one embodiment the viral construct comprises a gene that encodes a protein that 
is desirably expressed in one or more cells of a transgenic animal. Preferably the gene of 
interest is located between the 5' LTR and 3' LTR sequences. Further, the gene of interest 
is preferably in a functional relationship with other genetic elements, for example 
transcription regulatory sequences such as promoters and/or enhancers, to regulate 
expression of the gene of interest in a particular manner once the transgene is incorporated 
into the host genome. In certain embodiments, the useful transcriptional regulatory 
sequences are those that are highly regulated with respect to activity, both temporally and 
spatially. 
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Preferably the gene of interest is in a functional relationship with internal 
promoter/enhancer regulatory sequences. An "internal" promoter/enhancer is one that is 
located between the 5* LTR and the 3' LTR sequences in the viral construct and is operably 
linked to the gene that is desirably expressed. 
5 The internal promoter/enhancer may be any promoter, enhancer or 

promoter/enhancer combination known to increase expression of a gene with which it is in 
a functional relationship. A "functional relationship" and "operably linked" mean, without 
limitation, that the gene is in the correct location and orientation with respect to the 
promoter and/or enhancer that expression of the gene will be affected when the promoter 

1 0 and/or enhancer is contacted with the appropriate molecules. 

The internal promoter/enhancer is preferably selected based on the desired 
expression pattern of the gene of interest and the specific properties of known 
promoters/enhancers. Thus, the internal promoter may be a constitutive promoter. Non- 
limiting examples of constitutive promoters that may be used include the promoter for 

15 ubiquitin, CMV (Karasuyama et al J. Exp. Med. 169:13 (1989), B-actin (Gunning et al. 

Proc. Natl. Acad. Sci. USA 84:4831-4835 (1987) and pgk (see, for example, Adra et al. 
Gene 60:65-74 (1987), Singer-Sam et al. Gene 32:409-417 (1984) and Dobson et al. 
Nucleic Acids Res. 10:2635-2637 (1982)). Alternatively, the promoter may be a tissue 
specific promoter. Several non-limiting examples of tissue specific promoters that may be 

20 used include Ick (see, for example, Garvin et al. Mol. Cell Biol. 8:3058-3064 (1988) and 

Takadera et al. Mol. Cell Biol. 9:2173-2180 (1989)), myogenin (Yee et al. Genes and 
Development 7:1277-1289 (1993), and thyl (Gundersen et al. Gene 113:207-214 (1992). 
In addition, promoters may be selected to allow for inducible expression of the transgene. 
A number of systems for inducible expression using such a promoter are known in the art, 

25 including the tetracycline responsive system and the lac operator-repressor system. It is 

also contemplated that a combination of promoters may be used to obtain the desired 
expression of the gene of interest. The skilled artisan will be able to select a promoter 
based on the desired expression pattern of the gene in the resulting transgenic animal. 

An internal enhancer may also be present in the viral construct to increase 

30 expression of the gene of interest. For example the CMV enhancer (Karasuyama et al J. 

Exp. Med. 169:13 (1989) may be used in combination with the chicken 6-actin promoter. 
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Again, one of skill in the art will be able to select the appropriate enhancer based on the 
desired expression pattern. 

The gene of interest is not limited in any way and includes any gene that the skilled 
practitioner desires to have integrated and/or expressed in a transgenic animal. For 
example, the gene of interest may be one that encodes a protein that modifies a physical 
characteristic of the transgenic animal, such as a protein that modifies size, growth, or 
tissue composition. In another example the gene of interest may encode a protein of 
commercial value that may be harvested from the transgenic animal. 

In addition, more than one gene of interest may be placed in functional relationship 
with the internal promoter. For example a gene encoding a marker protein may be placed 
after the primary gene of interest to allow for identification of cells that are expressing the 
desired protein. In one embodiment a fluorescent marker protein, preferably green 
fluorescent protein (GFP), is incorporated into the construct along with the gene of interest. 
If a second reporter gene is included, an internal ribosomal entry site (IRES) sequence is 
also preferably included. The IRES sequence may facilitate the expression of the reporter 
gene 

The viral construct may also contain additional genetic elements. The types of 
elements that may be included in the construct are not limited in any way and will be 
chosen by the skilled practitioner to achieve a particular result. For example, a signal that 
facilitates nuclear entry of the viral genome in the target cell may be included. An example 
of such a signal is the HTV-1 flap signal. 

Further, elements may be included that facilitate the characterization of the provirus 
integration site in the genome of the animal. For example, a tRNA amber suppressor 
sequence may be included in the construct. 

In addition, the construct may contain one or more genetic elements designed to 
enhance expression of the gene of interest. For example, a woodchuck hepatitis virus 
responsive element (WRE) may be placed into the construct (Zufferey et al. J. Virol. 
74:3668-368 1 ( 1 999); Deglon et al. Hum. Gene Then 1 1 : 1 79- 1 90 (2000)). 

A chicken p-globin insulator may also be included in the viral construct. This 
element has been shown to reduce the chance of silencing the integrated provirus in the 
transgenic animal due to methylation and heterochromatinization effects. In addition, the 
insulator may shield the internal enhancer, promoter and exogenous gene from positive or 
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negative positional effects from surrounding DNA at the integration site on the 
chromosome. 

Any additional genetic elements are preferably inserted 3' of the gene of interest. 

In a specific embodiment, the viral vector comprises: a cytomegalovirus (CMV) 
5 enhancer/promoter sequence; the R and U5 sequences from the HTV 5' LTR; the HIV-1 

flap signal; an internal enhancer; an internal promoter; a gene of interest; the woodchuck 
hepatitis virus responsive element; a tRNA amber suppressor sequence; a U3 element with 
a deletion of its enhancer sequence; the chicken P-globin insulator; and the R and U5 
sequences of the 3' HIV LTR. 
10 The viral construct is preferably cloned into a plasmid that may be transfected into a 

packaging cell line. The preferred plasmid preferably comprises sequences useful for 
replication of the plasmid in bacteria. 

Production of Virus 

1 5 Any method known in the art may be used to produce infectious retroviral particles 

whose genome comprises an RNA copy of the viral construct described above. 

Preferably, the viral construct is introduced into a packaging cell line. The 
packaging cell line provides the viral proteins that are required in trans for the packaging of 
the viral genomic RNA into viral particles. The packaging cell line may be any cell line 

20 that is capable of expressing retroviral proteins. Preferred packaging cell lines include 293 

(ATCC CCL X), HeLa (ATCC CCL 2), D17 (ATCC CCL 183), MDCK (ATCC CCL 34), 
BHK (ATCC CCL- 10) and Cf2Th (ATCC CRL 1430). The most preferable cell line is the 
293 cell line. 

The packaging cell line may stably express the necessary viral proteins. Such a 
25 packaging cell line is described, for example, in U.S. Patent No. 6,2 1 8,1 8 1 . Alternatively a 

packaging cell line may be transiently transfected with plasmids comprising nucleic acid 
that encodes the necessary viral proteins. 

In one embodiment a packaging cell line that stably expresses the viral proteins 
required for packaging the RNA genome is transfected with a plasmid comprising the viral 
30 construct described above. 

In another embodiment a packaging cell line that does not stably express the 
necessary viral proteins is co-transfected with two or more plasmids essentially as described 
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in Yee et al. {Methods Cell. Biol 43A, 99-112 (1994)). One of the plasmids comprises the 
viral construct comprising the transgene. The other plasmid(s) comprises nucleic acid 
encoding the proteins necessary to allow the cells to produce functional virus that is able to 
infect the desired host cell. 

The packaging cell line may not express envelope gene products. In this case the 
packaging cell line will package the viral genome into particles that lack an envelope 
protein. As the envelope protein is responsible, in part, for the host range of the viral 
particles, the viruses are preferably pseudotyped. Thus the packaging cell line is preferably 
transfected with a plasmid comprising sequences encoding a membrane-associated protein 
that will permit entry of the virus into a host cell. One of skill in the art will be able to 
choose the appropriate pseudotype for the host cell that is to be used. For example, in one 
embodiment the viruses are pseudotyped with the vesicular stomatitis virus envelope 
glycoprotein (VSVg). In another embodiment, a mutant ecotropic envelope protein is used, 
such as the ecotropic envelope protein 4.17 (Powell et al. Nature Biotechnology 
18(12):1279-1282 (2000)). In addition to conferring a specific host range the pseudotype 
may permit the virus to be concentrated to a very high titer and may enhance safety by 
preventing the virus from infecting undesired cell types. 

In the preferred embodiment a packaging cell line that does not stably express viral 
proteins is transfected with the viral construct, a second vector comprising the HIV-1 
packaging vector with the env, nef 9 5'LTR, 3'LTR and vpu sequences deleted, and a third 
vector encoding an envelope glycoprotein. Preferably the third vector encodes the VSVg 
envelope glycoprotein. 

The recombinant virus is then preferably purified from the packaging cells, titered 
and diluted to the desired concentration. 

Transgenic Animals 

In order to make transgenic animals, an oocyte or one or more embryonic cells are 
infected with the recombinant virus produced as described above. One of skill in the art 
will recognize that the method of infection and the treatment of the cell following infection 
will depend upon the type of animal from which the cell is obtained. For example, 
mammalian cells are preferably implanted in a pseudopregnant female following infection 



-16- 



WO 03/022228 



PCT/US02/29157 



while for the generation of transgenic birds or fish, the virus is preferably delivered to a laid 

egg and thus implantation is not required. 

While early methods of making transgenic animals required the cells to be rapidly 

dividing, there is no such requirement in the methods of the present invention. Thus the 
5 cell may be contacted at any point in development. In the preferred embodiment, a zygote 

is contacted with the recombinant virus. 

The cells to be infected with the virus may be obtained by any method known in the 

art and appropriate for the specific species in which it is desired to make a transgenic 

animal. For example, the recovery of fertilized mouse oocytes is described in Hogan et al. 
10 (Manipulating the Mouse Embryo: A Laboratory Manual. 2 nd ed. Cold Spring Harbor 

Laboratory Press, NY (1994)). A method for obtaining fertilized rat oocytes is described in 

Armstrong et ah (Biol. Reprod. 39,511-518 (1998)). 

It is not necessary that the cells be contacted after fertilization. In one embodiment, 

the virus is delivered to unfertilized ova. Development may then be initialized, for example 
15 by in vitro fertilization. 

Delivery of the Virus 

The virus may be delivered to the cell in any way that allows the virus to infect the 
cell. Preferably the virus is allowed to contact the cell membrane. Two preferred methods 
20 of delivering the virus to mammalian cells, injection and direct contact, are described 

below. 

Injection 

In a first embodiment the virus is injected into the perivitelline space between the 
zona pellucida and the cell membrane of a single-cell zygote. Preferably less than 50 

25 picoliters of viral suspension is injected, more preferably less than 25 picoliters and even 

more preferably about 10 picoliters. 

The virus is preferably present in a viral suspension and may be injected by any 
method known in the art. The viral suspension is preferably injected through a hydraulic 
injector. More preferably a glass micropipette is used to inject the virus. In one 

30 embodiment a micropipette is prepared by pulling borosilicate glass capillary on a pipette 

puller. The tip is preferably opened and beveled to approximately 10 \im. The lentiviral 
suspension may be loaded into the micropipette from the tip using gentle negative pressure. 
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In one embodiment the cell is stabilized with a holding pipette mounted on a 
micromanipulator, such as by gentle negative pressure against a fire-polished pipette, and a 
second micromanipulator is used to direct the tip of a micropipette into the space between 
the zona pellucida and the cell membrane, where the virus is injected. 
Direct contact 

In another embodiment the zona pellucida is removed from the cell to produce a 
denuded embryo and the cell membrane is contacted with the virus. The zona pellucida 
may be removed by any method known in the art. Preferably it is removed by enzymatic 
treatment. For example, treatment with pronase may be used to remove the zona pellucida 
while the cell membrane is kept intact. Alternatively, the cell may be placed in media at pH 
at which the zona pellucida dissolves while the cell membrane remains intact. For example 
the cell may be incubated in an acidic Tyrode's solution at room temperature for several 
minutes. Once the zona pellucida is removed, any method that allows for the virus to 
contact the cell membrane may be used. Preferably, the cell is incubated in a solution 
containing the virus. Even more preferably, the solution is media that facilitates survival of 
the cell. 

In this embodiment, the cells are preferably contacted with the virus in culture 
plates. The virus may be suspended in media and added to the wells of a multi-well culture 
plate. The cells may then be plated in the individual wells. The media containing the virus 
may be added prior to the plating of the cells or after the cells have been plated. Preferably 
individual cells are incubated in approximately 10 pi of media. However, any amount of 
media may be used as long as an appropriate concentration of virus in the media is 
maintained such that infection of the host cell occurs. 

The cells are preferably incubated with the virus for a sufficient amount of time to 
allow the virus to infect the cells. Preferably the cells are incubated with virus for at least 1 
hour, more preferably at least 5 hours and even more preferably at least 10 hours. 

Both the injection and direct contact embodiments may advantageously be scaled up 
to allow high throughput transgenesis. - Because of the relative simplicity of the injection 
technique, it is possible to inject many embryos rapidly. For example, it is possible to 
inject more than 200 fertilized oocytes in less than one hour. With regard to the direct 
contact embodiment, any number of embryos may be incubated in the viral suspension 
simultaneously. This may be accomplished, for example, by plating the desired number of 
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single-cell zygotes in multi-well tissue culture plates containing the virus suspended in 
media appropriate for the survival and growth of the cells. 

In both embodiments, any concentration of virus that is sufficient to infect the cell 
may be used. Preferably the concentration is at least 1 pfu/^il, more preferably at least 10 
5 pfu/^1, even more preferably at least 400 pfli/^1 and even more preferably at least 1 x 10 4 

pfu/^xl. 

Following infection with the virus, the cells are preferably implanted in an animal. 

More preferably cells infected with the virus are implanted in pseudo-pregnant animals of 

the same species from which the infected cells were obtained. Methods of creating pseudo- 
10 pregnancy in animals and implanting embryos are well known in the art and are described, 

for example, in Hogan et al. (Manipulating the Mouse Embryo: A Laboratory Manual. 2 nd 

ed. Cold Spring Harbor Laboratory Press, NY (1994)). 

In the preferred embodiment early stage embryos (approximately 0 - 2.5 days p.c.) 

still with an intact zona pellucida are transferred to the oviduct of timed pseudopregnant 
15 female (preferably 0.5 days p.c), while embryos that have reached the blastocyst stage are 

transferred to the uterus of timed pseudopregnant females (preferably 2.5 days p.c). 

Denuded embryos are preferably cultured in vitro until they reach the morula or blastocyst 

stage (48 to 72 hours in culture), and are then implanted into appropriately timed 

pseudopregnant females. 

20 The embryos and resulting animals may be analyzed, for example for integration of 

the transgene, the number of copies of the transgene that integrated, the location of the 
integration, the ability to transmit the transgene to progeny and expression of the transgene. 
Such analysis may be carried out at any time and may be carried out by any methods known 
in the art. Standard techniques are described, for example, in Hogan et al. (supra). 

25 The methods of infecting cells disclosed above do not depend upon species-specific 

characteristics of the cells. As a result, they are readily extended to all mammalian species. 

Initial experiments with mice indicate that of those animals that develop to full 
term, 80-90% carried at least one copy of the transgene and that, of these, approximately 
85% express the gene of interest. Of the transgenic animals about 25% carry only 1 or 2 

30 copies of the transgene. The highest number of proviral insertions observed was about 30. 

Of the animals that carried only 1 or 2 copies of the transgene, about 80% expressed the 
gene of interest. 
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As discussed above, the modified retrovirus can be pseudotyped to confer upon it a 
broad host range. One of skill in the art would also be aware of appropriate internal 
promoters to achieve the desired expression of a gene of interest in a particular animal 
5 species. Thus, one of skill in the art will be able to modify the method of infecting cells to 

create transgenic animals of any species. 

For example, transgenic birds are created by delivering a modified retrovirus, as 
described above, to the primordial germ cells of early stage avian embryos. In one 
embodiment, freshly laid eggs are obtained and placed in a temperature controlled, 

10 humidified incubator. Preferably, the embryonic blastodisc in the egg is gradually rotated 

to lie on top of the yolk. This may be accomplished by any method known in the art, such 
as by gently rocking the egg regularly, preferably every 15 minutes. Approximately 36 
hours later, the modified retrovirus is delivered into the space between the embryonic disk 
and the perivitelline membrane. Preferably about 50 nL of viral solution is delivered, more 

1 5 preferably about 100 nL of viral solution is delivered, and even more preferably about 200 

nL of viral solution is delivered. The viral solution may be delivered by any method known 
in the art for delivering compositions to the inside of an egg. In the preferred embodiment a 
window is opened in the shell, the viral solution is injected through the window and the 
shell window is closed. The eggs are preferably incubated until hatching. The eggs will 

20 hatch after approximately 20 days, depending upon the particular avian species from which 

they are obtained. Hatched chicks are preferably raised to sexual maturity and mated. The 
transgenic offspring of the founder animals may be identified by any method known in the 
art, such as Southern blot, PCR and expression analysis. 

In another embodiment, transgenic fish are created by delivering the modified 

25 retrovirus, described above, to single-cell fish embryos. Fertilized fish eggs are collected 

by any method known in the art. The modified retrovirus is then preferably delivered to the 
space between the chorion and the cell membrane. This may be accomplished, for 
example, by loading the modified retrovirus in solution into a glass pipette. The pipette 
may then be used to pierce the chorion membrane and deliver the viral suspension. 

30 Preferably about 50 nL of viral solution is delivered, more preferably about 100 nL of viral 

solution is delivered, and even more preferably about 200 nL of viral solution is delivered. 
Injected embryos are preferably returned to a temperature-controlled water tank and 
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allowed to mature. At sexual maturity the founder fish are preferably mated and their 
progeny analyzed for the presence of the transgene by any method known in the art. 

As mentioned above, the methods of the present invention will also prove useful in 
techniques for identifying genes that are involved in specific biological processes, such as 
gene trap assays and large-scale mutagenesis screens. 

Gene trap experiments allow the identification and cloning of a gene that is 
expressed in a particular tissue or cell type, and/or at a particular time, based solely on its 
pattern of expression. Gene trapping relies on the capture of the splicing donor of an 
mRNA by ectopically inserting a downstream splice acceptor, in this case, carried within an 
integrated provirus. Gene trapping has been successfully used in several model systems, 
including the fruit fly Drosophila, mammalian cells in culture, and mouse ES cells (which 
have the advantage of being able to be used to derive mice afterwards for further analysis). 
Gene trapping in cell culture has the advantage of being fast and inexpensive, but is limited 
by the inability of the cells to differentiate into specific cell types. Thus, gene trapping 
experiments in mammalian cell lines in culture usually yield only housekeeping genes 
expressed non-specifically in any mammalian cell, or cell-specific genes that are only 
expressed by the particular cell line in vitro. Because cell lines often show incomplete 
degrees of differentiation, the complement of tissue-specific genes expressed by these cells 
is limited. Furthermore, there are many tissues for which representative cell lines do not 
exist. 

The use of the above-described recombinant lentiviral vectors for the purposes of 
gene trapping is facilitated by the self-inactivating mutation in the U3 enhancer element of 
the 3' LTR. The lack of transcriptional activity from the integrated 5' LTR ensures that any 
transcription of a reporter element in the provirus is driven by upstream regulatory 
sequences to the insertion that have been "trapped," rather than from the viral promoter 
itself. 

Thus, one embodiment of the present invention concerns a method of identifying 
genes that are expressed in a particular tissue and/or at a particular time during the 
development of an organism. A self-inactivating viral construct is made that preferably 
comprises a splice acceptor sequence and a sequence encoding a reporter gene. Modified 
retroviral particles are made using the viral construct as described above and used to infect 
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embryonic cells. Tissues from the founder animal or its progeny are analyzed for the 
presence of the reporter to determine the temporal and/or spatial pattern of expression. 
Messenger RNA is collected from the tissues of the animals that express the reporter 
protein in the time and place of interest. The "trapped" gene may then be identified by any 
method known in the art. Preferably, oligonucleotides that are complementary to the 
reporter gene may then used in a reverse transcription reaction to produce a cDNA that 
contains the sequences of the trapped gene that flank the provirus. The cDNA may then be 
cloned into a plasmid from which the gene may be identified by nucleotide sequencing. 

Gene trap experiments are well known in the art and the skilled artisan would be 
able to choose the reporter gene, splice acceptor sequence and any other genetic elements 
that would be useful to include in the viral construct based on the specific analysis that they 
have undertaken. In addition, by modifying the viral constructs, the technique can be used 
to trap promoter or enhancer sequences. For promoter trap experiments, the reporter gene 
lacks any transcriptional regulatory elements, and is only expressed when the virus 
integrates next to an active promoter. For the enhancer trap, the reporter gene is positioned 
downstream of a minimal promoter that lacks transcriptional activity, and is only expressed 
when the virus integrates next to an active enhancer. 

Another important paradigm by which biologists study gene function is to disrupt 
the function of an endogenous gene and, from the mutant phenotype that results, deduce the 
normal role of that gene in the organism. One way of isolating genes that are important in a 
particular biological process under study is to perform large-scale mutagenesis to generate 
animals that are phenotypically mutant in that process and then to isolate the gene that is 
disrupted in the mutant animal and that is thus responsible for the mutant phenotype. In 
most such experiments, either radiation or chemicals have been used to induce deletions or 
nonsense mutations. However, the genes carrying mutations induced by radiation or 
chemicals are difficult to isolate because no handle is available with which to clone the 
gene. Rather, these mutations must be identified by positional cloning, a slow and 
painstaking process in which the mutation of interest is systematically mapped relative to 
known genetic markers in the genome, with the goal of gradually narrowing down and 
pinpointing the locus of the mutated gene. 

In contrast, a powerful technique that has been used successfully in the fruit fly 
Drosophila melanogaster is that of insertional mutagenesis, in which genes are disrupted 
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when an exogenous piece of DNA is inserted within the coding sequence of the gene. The 
great advantage of insertional mutagenesis is that, because the sequence of the exogenous 
disrupting DNA is known, one can directly clone out that piece of DNA and the flanking 
sequence that corresponds to the gene of interest that has been disrupted. Thus, in contrast 
to traditional positional cloning strategies used in chemical mutagenesis which may take up 
to 3 years after the isolation of the mutant, identification and characterization of the 
mutated gene of interest in an insertional mutagenesis strategy is reduced to just a week or 
so. The main limitation to the application of insertional mutagenesis in organisms other 
than the fruit fly is the lack of a DNA element, such as the transposon used in Drosophila, 
that is able to stably integrate and mark its position in the genome at the germline or one- 
cell embryo stage. 

The lentiviral vectors described above can be effective tools for large-scale 
mutagenesis to identify genes involved in specific biological processes. The modified 
lentiviruses of the present invention are easily delivered to the germline, and pseudotyping 
of the viruses with an envelope glycoprotein, such as VSVg, allows the concentration of the 
virus to extremely high titers. Thus in one embodiment mutagenesis is achieved by 
delivery of the modified retrovirus to the cell membrane of embryonic cells. 

The ability of transgenic animals made by the methods of the present invention to 
express a gene of interest at high levels suggests that the integrated provfruses are not 
silenced by methylation. Previous mutagenesis screens using MoLV-based retroviruses 
have been limited by the observation that, in addition to the provirus, flanking genomic 
sequences are frequently found to be methylated and inactivated. This methylation 
complicates the analysis because it becomes difficult to distinguish whether the mutant 
phenotype is due to the disruption of the gene into which the provirus has inserted, or due 
to the inactivation of any one of several surrounding genes by methylation. 

By delivering modified lentiviruses to embryos according to the methods of the 
present invention, insertional mutagenesis strategies can be applied to any animal species, 
including model genetic organisms such asXenopus, zebrafish, mouse, rat, and zebra finch. 
Early-stage embryos, consisting of several cells, will preferably be targeted because the 
resulting mosaicism increases the number of unique mutagenic events that can be screened. 

The modified lentiviruses integrate randomly into the genome of the target zygotes, 
including that of the germ cells. Thus, some proportion will disrupt coding sequences. The 
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emtwyos are preferably raised to sexual maturity, mated, and the progeny are screened for 
mutant phenotypes of interest. Once a mutant is identified, selective breeding using 
standard methods is preferably used to isolate the particular insertion(s) responsible for the 
phenotype. 

Once a mutant line is established, the mutated locus is preferably identified using 
the provirus as a handle for cloning. In one embodiment, an origin of replication and 
antibiotic resistance gene is included in the viral construct. In this embodiment, genomic 
DNA from the mutant is preferably isolated, randomly cleaved with an appropriate 
restriction enzyme, and the linear fragments circularized by ligation. The ligation mixture 
is then transformed into bacterial cells and plated on antibiotic plates. The plasmid DNA 
from any colonies that arise is isolated and preferably used as a template for inverse PCR 
with oppositely oriented, adjacent primers complementary to sequences in the provirus. 
The amplified DNA molecule(s) is then sequenced to acquire the flanking regions to the 
integration site, corresponding to the gene(s) mutated. 

In another embodiment, inclusion of the tRNA amber suppressor sequence in the 
provirus allows for the rapid generation of genomic libraries containing the flanking regions 
of the integration loci, representing the disrupted gene(s). Once these flanking regions are 
sequenced, they can be compared against the genomic sequence database for that animal to 
determine candidate gene(s) of interest. 

The following examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. Indeed, various 
modifications of the invention in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description and fall within the scope 
of the appended claims. 

All patent and literature references cited in the present specification are hereby 
incorporated by reference in their entirety. 

Example 1 

Transgenic mice were generated that expressed a heterologous protein, green 
fluorescent protein (GFP). GFP expression was controlled by manipulating the genetic 
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elements in the viral construct employed to create the transgenic mice. For example, a viral 
construct, FUGW, comprising a ubiquitous promoter was used to produce transgenic mice 
that expressed GFP in every cell. Inclusion of a nuclear localization signal produced 
transgenic mice that had GFP localized in the nucleus of their cells. A viral construct with 
5 a lymphocyte specific promoter produced mice that expressed GFP in lymphocytes, while a 

viral construct with a muscle specific promoter produced mice that specifically expressed 
GFP in muscle cells. 

A. Viral Constructs 

10 A viral construct according to the present invention was created using the HR'CS-G 

plasmid (Miyoshi H, Blomer U, Takahashi M, Gage FH, Verma DVL J Virol 72(10):8150-7 
(1998)). This plasmid is based on the HIV-1 HXB2 proviral DNA (see U.S. Patent No. 
6,013,516). 

15 1. Generation of a vector expressing GFP from a ubiquituous promoter, 

FUGW 

The HIV-1 flap sequence (SEQ ID NO: 1) was inserted into the HR'CS-G vector. 

A 147 base pair sequence containing the flap region (Zennou, V., Petit, C, Guetard, D., 

Nerhbass, U., Montagnier, L., Charneau, P. Cell 101(2),173-185 (2000)) was PCR 
20 amplified from a plasmid encoding the genome of HIV NLA4.3. The 5* PCR primer 

encoded BgUI and Pad sites. The 3' PCR primer contained the BamHI site. The resulting 

PCR product was digested with Bgin and BamHI enzymes and inserted into the BamHI site 

of the HR'CS-G vector. The resulting plasmid was called Hflap. 

The ubiquitin promoter (SEQ ID NO: 2) was then inserted into the Hflap plasmid. 
25 The 1.2 Kb sequence encoding the human polyubiquitin C promoter was excised with Bgin 

and BamHI enzymes and inserted into the BamHI site of Hflap. The resulting plasmid was 

called HflapUbi. 

A multi-cloning site was then inserted into HflapUbi. Two oligonucleotides were 
designed that encoded the following restriction sites: BamHI Hpal Xhol AscI EcoRI Bgin. 
30 The oligos were hybridized and inserted into the BamHI site of HflapUbi. The resulting 

plasmid was called FUMCS. 

A nucleic acid sequence encoding GFP (SEQ ED NO: 3) was then inserted into 
HflapUbi. The 700 base pair sequence of GFP was digested with BamHI and Xhol and 
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inserted into the Xhol site of HflapUbi, generating HflapUbiG. The resulting plasmid was 
called FUG. 

The woodchuck hepatitis virus regulator element (WRE; SEQ ID NO: 4) was then 
inserted into HflapUbiG. The 500 bp sequence of WRE (Zufferey, R, Donello, J.E., 
Trono, D., Hope T.J. (1999). J. Virol 73(4), 2886-92) was excised with Sail and Xhol and 
inserted into the Xhol site of HflapUbiG, generating HflapUbiGWRE (SEQ ID NO: 8). 
The resulting plasmid was called FUGW. A map of the FUGW viral vector is presented in 
Figure 1A. 

2 . feneration of « vector excret ing nuclear-localized GFP from a 
nhi miitiious promote r., FUH2BGW 

m order to get specific nuclear localization of GFP, the histone 2B-GFP fusion 

sequence H2BGFP was cloned into FUGW. The histone 2B-GFP sequence (SEQ ID NO: 
5) was digested with Sail and Notl. Both sites were blunted with T4 DNA polymerase and 
inserted into the Hpal site of FUMCS. The resulting plasmid was called FUH2BGW. 

3 . feneration of « vector expres sing GFP from a lymph ocvte-specific 

promoter. F IckGW . , , /ccri to 

To achieve lymphocyte specific GFP expression the murine lck promoter (SEQ ID 

NO: 6) was cloned into the FUGW vector. The ubiquitin promoter from FUGW was 

removed by excising with Pad and BamHl. The Pad site was blunted using T4 DNA 

polymerase and the lck promoter was excised with Spel and BamHI. The Spel site was 

blunted using T4 DNA polymerase. The lck promoter was the inserted into the Pad and 

BamHI sites of FUGW. The resulting plasmid was called FIckGW. 

4 . feneration of a vector expr essing GFP fr»™ » muscle-specific 
promoter. Fm voH2BGW 

To achieve specific expression of GFP in the muscle of transgenic mice the 

myogenin promoter (SEQ ID NO: 7) was cloned into the FUGW construct. The mouse 
myogenin promoter was PCR amplified from a mouse genomic BAC. The 5' PCR primer 
encoded a Pad site, and the 3' PCR primer contained an Xbal site. The PCR product was 
digested with Pad and Xbal. The ubiquitin promoter was removed from the FUH2BGW 
vector by cutting with Pad and Xbal. The Pad-BamHI digested PCR product encoding the 
myogenin promoter was inserted into the Pad and Xbal sites of the FUH2BGW vector. 
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The constructs described above were then used to prepare recombinant lentivirus. 
Briefly, replication-incompetent viral vectors, based on the human immunodeficiency 
virus- 1 (HIV-1), were pseudotyped with the vesicular stomatitis virus envelope 
5 glycoprotein (VSVg), permitting the virus to be concentrated to very high titers and 

conferring upon the virus a broad host range. Pseudotyped lentiviruses were produced 
essentially as described in detail in Yee, J.K., Friedmann, T. & Burns, J.C. (1994). 
Methods Cell Biol 43, 99-1 12; Burns, J.C, Friedmann, T., Driever, W., Burrascano, M., 
and Yee, J.K. (1993). Proc. Natl Acad. Sci. USA. 90, 8033-8037; and Yee, J.K., 

10 Miyanohara, A., LaPorte, P., Bouic, K., Burns, J.C, and Friedmann, T. (1994). Proc. Natl. 

Acad. Sci. USA. 91, 9564-9568. Briefly, human fibroblasts 293 cells were transfected with 
calcium phosphate/DNA coprecipitates of the following plasmids, as described in Gorman, 
C, Padmanabhan, R. and Howard, B.H. (1983). Science. 221,551-553: 

The viral transfer vector described above with self-inactivating LTR; 

15 the HIV-1 packaging vector A8.9 (Zufferey, R., Nagy, D., Mandel, R.J., Naldini, L., 

and Trono, D. (1997). Nat. Biotechnol 15(9), 871-875; Naldini, L., Blomer, U., Gallay, 
P., Ory, D., Mulligan, R., Gage, F.H., Verman, I.M., and Trono, D. (1996). Science. 
272(5259), 263-67) with env. vpr, vpu, vif nef 9 5* LTR, 3' LTR, and \|/ sequences deleted; 
and 

20 the VSVg envelope glycoprotein. 

Viral supernatant was harvested 60 hours post-transfection, subjected to low-speed 
centrifiigation to remove cell debris, filtered through a 0.45 jam nitrocellulose membrane, 
spun at 25,000 rpm for 1.5 hours to concentrate, and resuspended in a small volume (one 
hundredth to one thousandth of the original volume) of phosphate-buffered saline (PBS), 

25 pH 7.4. The titer of the viral concentrate was approximately lxlO 6 pfu/jal as determined in 

293 human fibroblasts measured by the number of GFP-positive cells. The viral suspension 
was stored frozen at -80°C 

B. Production of transgenic mice and rats 

30 "~ ~ 

The lentivirus was used to produce transgenic mice and rats. 
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1. Superovulation and embrvo collection 

Female mice and rats were superovulated with a combination of pregnant mare's 
serum (PMS) and human chorionic gonadotropin (hCG) as described in Hogan, B., 
Beddington, R, Costantini, F., and Lacy, E. (1994). Manipulating the Mouse Embryo: A 
Laboratory Manual. Cold Spring Harbor Laboratory Press. "Superovulation" refers to 
administering gonadotropins to female mammals prior to mating to increase the number of 
eggs that are ovulated. Prepubescent female mice (approximately 25 days of age and 
weighing between 12.5 and 14 grams) were injected intraperitoneal ly with 5 IU of PMS 
(Sigma G 4527, 25 IU/ml in 0.9% NaCl) between 1 and 3 p.m. on day -2, followed by 5 IU 
of HCG (Sigma C 8554, 25 IU/ml in 0.9% NaCl) 48 hours later on day 0. Prepubescent 
female rats between 28-30 days of age and weighing between 70 and 80 grams were 
injected intraperi tonally with 25 IU of PMS between 1 and 3 p.m. on day -2, followed by 5 
IU of HCG 48 hours later on day 0. For both rats and mice, hormone-treated females were 
caged overnight with fertile males (2-3 months of age) to mate. On the morning of day 1, 
females were checked for copulation plugs. 

Female mice were sacrificed for embryo collection around 10 a.m. on the morning 
of day 1, while female rats were sacrificed for collection around 1 p.m. on the afternoon of 
day 1. Embryos were collected from mice and rats essentially following the procedure 
described in Hogan, B., Beddington, R., Costantini, F., and Lacy, E. (1994) Manipulating 
the Mouse Embryo: A Laboratory Manual. Cold Spring Harbor Laboratory Press. Briefly, 
animals were sacrificed by C0 2 inhalation, and the oviducts were excised and transferred to 
a dish containing M2 medium at room temperature. Newly fertilized embryos, enclosed by 
cumulus mass cells, were released from the swollen ampullae (the upper portion of the 
oviduct) by gently tugging and opening the walls of the ampullae with fine forceps. The 
embryos were then transferred to a dish containing a hyaluronidase solution (Sigma H 
3884, 300 pg/ml in M2 medium), which enzymatically digested the cumulus cells, thus 
releasing the embryos. When the cumulus cells were shed, the embryos were transferred to 
fresh M2 medium to wash off the hyaluronidase solution and preserve the viability of the 
embryos. In rats, the cumulus cells were found to adhere tenaciously to the surface of some 
embryos and were difficult to remove completely. Thus in some cases the subsequent 
experimental manipulations with the zygotes were carried out with some of the cumulus 
cells still adhering. This did not seem to affect the outcome. The embryos were then 
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transferred to microdrops of M16 medium under mineral oil and cultured in a humidified 
37°C incubator under 5% C0 2 until needed. 

2. Delivery of lentiviruses to single-cell embryos 

Lentiviruses were delivered to the fertilized oocytes on the same day of collection, 
targeting only single-cell zygotes to minimize mosaicism. Infection with lentivirus derived 
from the FUGW viral construct will lead to integration of the provirus locus diagrammed in 
Figure IB. Two different methods were used to deliver the lentiviruses to the embryos: 

a. Microinjection of lentiviruses into the perivitelline space of 
single-cell embryos 

Micropipettes were prepared by pulling borosilicate glass capillaries (1 mm O.D., 
0.7 mm LD.) on a Sutter Instruments pipette puller. The tip was cut at an angle to 
approximately 10 jim with a razor blade. The micropipette was then inserted into the 
pipette holder of a CellTram hydraulic injector (Eppendorf). The lentiviral concentrate 
prepared above was pipetted up and down to release any large aggregates of cellular debris. 
The virus was centrifiiged at low speed in a tabletop microfiige (1000 rpm for 1 min.), and 
removed from the top. The viral suspension was then loaded into the micropipette from the 
tip using gentle negative pressure from the CellTram. 

One-cell embryos were transferred to a microdrop of M2 medium on a slide and 
covered with mineral oil to maintain the osmolality. The slide was mounted on the stage of 
an inverted light microscope, and the injection procedure was monitored under 400X 
magnification. Embryos were held in place against a fire-polished pipette using gentle 
negative pressure. The pipette holder with the virus was loaded onto a micromanipulator 
(Leitz). Using the micromanipulator to guide the pipette, the tip was pushed through the 
zona pellucida into the region between the zona pellucida and the oocyte cell membrane. 
Using gentle positive pressure, approximately 10 nanoliters of the viral concentrate was 
delivered into the perivitelline space. The micropipette was then withdrawn from the 
zygote. After the injection, the embryos were sorted and those that were lysed, abnormal, 
or at the 2-cell stage were discarded. The remaining embryos were transferred to Ml 6 
microdrops under oil and cultured in a 37°C incubator under 5% CO2 until implantation. 
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b. Co-incubation of denuded single-cell embryos with lentiviruses 

The zona pellucida of the fertilized oocytes was removed by incubation in either an 
acidic Tyrode's solution (Hogan, B., Beddington, R., Costantini, F., and Lacy, E. (1994). 
Manipulating the Mouse Embryo: A Laboratory Manual Cold Spring Harbor Laboratory 
Press) or a 0.5% pronase solution in M2 medium at 37°C in a humidified 5% C0 2 
incubator for several minutes. When the zonae appeared to be dissolved, embryos were 
washed in excess M2 medium and then transferred into 10 |il microdrops of viral 
suspension under mineral oil. Embryos were cultured individually in separate microdrops 
to prevent them from adhering to one another. The viral suspension was diluted to various 
concentrations to roughly control the average number of proviral integrations expected per 
transgenic genome. Virus in the microdrops was diluted to 2xl0 4 pfu/^il, 400 pfii/^1, and 8 
pfu/jal. Zygotes were incubated in the viral suspension for at least 4-6 hours before 
implantation to allow viral entry into the cell. 

3. Transfer of embryos into recipient females 

Timed pseudopregnant females to host the treated embryos were prepared by mating 
sexually mature females in estrus to vasectomized, mature males the night before the 
intended day of implantation. Appropriate females were selected from a colony of 30-40 
females by taking vaginal smears and examining them for the cell types typical of the estrus 
phase. Males were vasectomized by tying off the vas deferens at two separate locations, 
approximately 5-6 mm apart, then cauterizing the intervening segment to sever the tube. 
Males were vasectomized at least 2 weeks prior to the mating to ensure that all remaining 
sperm in the genital tract were dead at the time of mating. 

Embryos infected with lentivirus were transferred into host females as soon as 
possible to achieve maximum rates of implantation. Early-stage embryos (0-2.5 days p.c.) 
with an intact zona pellucida were transferred to the oviduct of timed pseudopregnant 
females (0.5 days p.c), while embryos that had reached the morula or blastocyst stage were 
transferred to the uterus of timed pseudopregnant females (2.5 days p.c). In general, no 
more than 30 embryos were transferred bilaterally into the uterus. These procedures were 
carried out essentially as described in (Hogan, B., Beddington, R., Costantini, F., and Lacy, 
E. (1994). Manipulating the Mouse Embryo: A Laboratory Manual Cold Spring Harbor 
Laboratory Press). Pregnancy and delivery of the transgenic litter was as usual. 
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C. Analysis of transgenic animals 

Animals in the resulting litters were analyzed for the presence of the transgene and 
the number of insertions of the transgene by standard Southern blot analysis (Sambrook, J., 
5 Fritsch, E.F., Maniatis, T. (1989). Molecular Cloning: A Laboratory Manuel Cold Spring 

Harbor laboratory Press.), cutting with PstI or BamHI and hybridizing against a GFP+WRE 
probe. For constitutive promoters, expression of GFP was determined by directly viewing 
the skin of the animals under a conventional epi fluorescence microscope. Some transgenic 
animals that were scored as negative for expression were actually expressing the transgene 

10 at levels below that of detection by visual inspection with a fluorescent microscope. In 

such cases, western blot analysis revelaed that animals in which GFP fluorescence was not 
detected by visual inspection did express the GFP protein in some tissues. Similarly, 
immunocytochemistry proved to be a more sensitive assay for determining expression. For 
the tissue-specific promoters, some proportion of the transgenic litter was sacrificed during 

15 development at embryonic stages, and the translucent embryos were checked for spatially 

regulated GFP expression under a fluorescent microscope. Expression results were 
confirmed by histology. To test the ability of the founder animals to transmit the transgene 
to their progeny, animals positive by Southern analysis were outcrossed to wild-type 
animals, and their progeny scored for transgenesis and expression as described above. 

20 

D. Results 

In one set of experiments one-cell mouse and rat embryos were injected in the 
perivitelline space with recombinant lentivirus as described above. In the first experiment, 
1 7 founder mice developed to term from 78 implanted embryos. Of these, 1 1 of the 1 7 

25 founders expressed the transgene as determined by directly viewing the animals under an 

epifluoresence microscope. Further, 11 of 15 (two mice died prior to analysis), or 
approximately 73%, were found to carry the transgene by Southern blot analysis. The 
average number of insertions in the transgenic mice was 6.1. Several of the animals carried 
as few as 2 insertions. These results are presented in Figure 2. 

30 In a second experiment, 56 founder mice developed to term from 119 implanted 

embryos. Of these 45, or about 80%, were found to express the transgene. Thus, in the two 
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experiments 58 out of 73 founder mice, approximately 79.5%, carried the transgene. Figure 
3 shows the Southern blot analysis of proviral transgene insertions in these founder mice. 

All GFP-positive mice carried an integrated provirus, and all animals with two or 
more copies of the provirus expressed the transgene at levels detectable by direct viewing 
of GFP fluorescence. The intensity of GFP fluorescence correlated positively with copy 
number, as estimated qualitatively. All major tissues and organs, including skin, bone, 
skeletal muscle, cardiac muscle, lung, liver, thymus, spleen, stomach, intestine, kidney, 
brain, retina and gonads were GFP positive. 

In a third experiment, five rats developed to term from embryos injected with 
lentivirus created from the FUGW construct. Two of the five rats were found to express 
the transgene as determined by brightfield and fluorescent images of the paws of the 
newborn rats (Figure 4A). Pup R4 expresses GFP in the paw, as well as in all other tissues 
and organs that were examined (Fig. 4A). Figure 4B shows the Southern blot analysis of 
the proviral insertions in these founder rats and indicates that pup R4 carries 4 copies of the 
proviral insert. 

In a continuation of this experiment, out of 22 founder rat pups bom from 130 
implanted embryos, 13 (59.1%) carried one or more proviral insertions as determined by 
Southern blot analysis and 9 (40.9%) expressed GFP at levels detectable by directly 
viewing the skin under a fluorescent microscope. GFP positive founders were crossed to 
wild-type animals, and Fl progeny rats carrying as few as one copy of the provirus 
expressed GFP, as determined by direct viewing with a fluorescent microscope, indicating 
that the GFP-expressing transgene is not silenced by transmission through the germline. 

In another set of experiments denuded mouse embryos were incubated in decreasing 
concentrations of recombinant lentivirus. A rough correlation was seen between the titer of 
virus in which embryos were incubated and the number of proviral insertions. At a 1 :50 
dilution from a stock of 1 x 10 6 pfu/jil 5 founder mice that reached term (from 29 implanted 
embryos) were found to be transgenic. All of these animals carried at least 6 proviral 
insertions. The average number of insertions was 7.2. Of these, 4, or 80%, were found to 
express the transgene. At a 1:250 dilution five out of 7 founder mice that reached term 
(from 1 8 implanted embryos) were found to be transgenic and express the transgene. In 
these mice the average number of insertions was 3.8, with two of the animals carrying only 
one or two copies of the transgene. Finally, at a dilution of 1:1250 only one of the 8 
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founders (from 40 implanted embryos) was found to be transgenic, with a single insertion. 
This founder also expressed the transgene. Figure 5 shows the Southern blot analysis of 
proviral transgene insertions in these founder mice. Figure 6 shows GFP expression in one 
of the founder mice. A second trial with a 1 :250 dilution gave comparable results. Eight of 
1 1 founder mice (from 59 implanted embryos) were transgenic, with seven expressing the 
transgene. The transgenic mice had an average of 2.6 insertions. 

Following outcrossing to wild-type animals, progeny were analyzed for proviral 
transgene insertions by Southern blot (Figure 7) and for GFP expression by viewing under 
an epifluoresence microscope (Figure 8). As can be seen in Table 1 , founder mice were 
able to transmit the transgene to their progeny. In Table 1, "PV" represents founder 
transgenics generated by injection of the lentivirus into the perivitelline space of one-cell 
embryos while "Co-inc" represents founder transgenics generated by co-incubation of the 
denuded embryos with lentivirus. 



Table 1 



Founder 


No. 
insertions in 
founder 


No. progeny 


No. 
expressing 


PV.13 


6 


7 


6/7 


I PV.2 


2 


4 


2/4 


PV.10 


2 


7 


1/7 


PV.17 


10 


10 


10/10 


Co-inc.18 


12 


12 


9/12 


Co-inc.2 


0 


9 


0/9 



Ubiquitous GFP expression similar to that of the founder animals was observed in 
transgenic Fl progeny, indicating that the pro virus was not inactivated through one round 
of gametogenesis and development. All animals carrying two or more insertions of the 
FUGW provirus expressed GFP at levels detectable by direct fluorescence. However, 
among transgenic lines carrying one proviral insertion, approximately half expressed the 
transgene at levels detectable by direct fluorescence. In one single insertion line in which 
GFP expression was not observed by direct viewing, GFP was detectable by Western blot 
analysis in some tissues (brain, testes) but not in others (heart, lung, liver, kidney, spleen). 
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In a further experiment, single-cell mouse zygotes were injected in the perivitelline 
space with recombinant lentivirus derived from the FMH2BGW viral construct described 
above. This construct comprises a histone2B-GFP fusion gene under the control of the 
myogenin promoter. The histone2B-GFP reporter was used to concentrate the fluorescence 
5 in the nuclei, making the signal more intense. Zygotes were implanted in pseudopregnant 

female mice and then recovered from the uterus at embryonic day 1 L5. As can be seen in 
Figure 9, all six founder-mice were transgenic as determined by Southern blot analysis of 
proviral transgene insertions. Of these, two animals were positive for tissue-specific GFP 
expression at embryonic day 1 1.5 (Figure 9). 

10 Figure 10 shows that at embryonic day 11.5, GFP expression is localized to the 

somites and can be seen in the emerging muscles in the limb buds, eye and jaw. This 
expression pattern is consistent with myogenin expression at this stage of development. 
Figure 1 1 shows the results of immunofluoresence studies of sections of an embryonic day 
11.5 embryo carrying the myogenin promoter driving GFP. Specific staining of somite 

15 tissues can be seen (Fig. 11). Figures 12 and 13 show further immunofluorescence studies 

of cross-sections of an El 1 .5-embryo carrying a myogenin promoter driving GFP. 

Fifteen-day old animals, derived from FMH2BGW-infected zygotes showed GFP 
fluorescence in the nuclei of skeletal muscle in the tongue, limbs, chest and jaw, but not in 
cardiac or smooth muscle or other non-muscle tissues examined, reflecting the known 

20 specificity of myogenin expression. Fl progeny from three independent founders expressed 

histone2B-GFP exclusively in the skeletal muscle lineage. Furthermore, progeny carrying 
as few as one FMH2BGW proviral insertion expressed histone2B-GFP in the appropriate 
tissue types at high levels detectable by direct viewing with a fluorescent microscope. 

In a further experiment, a viral vector containing GFP driven by the T-lymphocyte 

25 promoter lck y F/cAGW was delivered to the perivitelline space by injection as described 

above. The resulting transgenic mice expressed GFP exclusively in the thymus. 

Example 2 

30 Transgenic birds, such as chicken or quail, may be made by the methods of the 

present invention. 
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Freshly laid chicken eggs (day 0) are placed in a temperature-controlled, humidified 
incubator at 38°C. The embryonic blastodisc is gradually rotated to lie on top of the yolk 
by gently rocking the eggs in the incubator every 1 5 minutes. A window is opened in the 
shell and the blastodisc is visualized in freshly laid eggs (0 hours post-laying) or stage X 
5 embryos (36 hours post-laying). VSV-pseudotyped lentiviral particles in solution are 

loaded into a glass capillary micropipette. To maximize the chances of targeting primordial 
germ cells, virus is injected in the anterior regions of the O hour embryos and in the 
gonadal anlage of the 36 hours embryos. Approximately 200 nL of viral solution are 
delivered into the space between the perivitelline membrane and the embryonic disk with 

10 the aid of a hydraulic injector. The shell window is then closed with a porous tape to allow 

gas exchange between the embryo and the incubator atmosphere. The embryos are then 
incubated without rocking. The eggs will hatch after approximately 20 days of incubation 
time. Hatched chicks are raised to sexual maturity and then mated. The eggs laid by the 
mated females are raised to hatching and the resulting transgenic chicks are identified, such 

15 as by Southern blot, PCR or expression analysis. 

Example 3 

Transgenic zebra finch were made by the methods of the present invention. 

20 Freshly laid zebra finch eggs (day 0) were placed in a temperature-controlled, 

humidified incubator at 38°C. The embryonic blastodisc was gradually rotated to lie on top 
of the yolk by gently rocking the eggs in the incubator every 15 minutes. A window was 
opened in the shell and the blastodisc was visualized. VSV-pseudotyped lentiviral particles 
in solution were loaded into a glass capillary micropipette. The lentivirus was derived from 

25 the FUH2BGW viral construct described above. To maximize the chances of targeting 

primordial germ cells, virus was injected in the anterior regions of 0 hour embryos and in 
the gonadal anlage of 36 hour embryos. Approximately 200 nL of viral solution are 
delivered into the space between the perivitelline membrane and the embryonic disk with 
the aid of a hydraulic injector. 

30 The shell window was closed with a porous tape to allow gas exchange between the 

embryo and the incubator atmosphere. The embryos were then incubated without rocking. 
Figure 15 shows H2B-GFP expression in the extraembryonic tissue. Figure 16 shows H2B- 
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GFP expression inside of the zebra finch embryo, indicating that primordial genn cells 
carried and expressed the transgene. 

The eggs will hatch after approximately 20 days of incubation time. Hatched 
chimeric chicks are raised to sexual maturity and then mated. The eggs laid by the mated 
females are raised to hatching and the resulting transgenic chicks are identified, such as by 
Southern blot, PCR or expression analysis. 

Example 4 

Transgenic fish may be made by the methods of the present invention. Breeding 
pairs offish are. placed in a water tank with a grooved bottom, where fertilized eggs are 
deposited. Fertilized eggs (zygotes) are collected and held in embryo medium on ice. 
Zygotes are aligned in grooves formed in a slab of agarose. A modified lentivirus, as 
described above, is loaded into a glass capillary micropipette. The chorion membrane 
surrounding the zygote is pierced with the glass micropipette and 200 nL of viral solution 
are delivered into the space between the zygotic membrane and the chorion. Injected 
zygotes are returned to a temperature-controlled water tank and allowed to mature. At 
sexual maturity, the founder fish are mated and their progeny analyzed for the presence of 
the transgene, such as by Southern blot, PCR and protein analysis. 

Example S 

The modified lentivirus described above may also be used in gene trap experiments 
such as in zebrafish. As discussed above, this technique allows the identification and 
cloning of a gene that is expressed in a particular tissue or cell type and/or at a particular 
tune based solely on its pattern of expression. Zebrafish is an ideal system for gene 
trapping for several reasons. First, embryonic development occurs externally, allowing for 
easy manipulation and viewing of the embryos. Furthermore, early stage zebrafish embryos 
are translucent, and the pigmentation can be further suppressed for several more days by 
incubating the embryo in a 0.003% solution of l-phenyl-2-thiourea (PTU). The translucent 
property of zebrafish embryos facilitates the viewing of a live fluorescent reporter to 
identify trapped genes expressed in spatial or temporal patterns of interest. 
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Self-inactivating lentiviral vectors are engineered to contain a gene trap element 
consisting of the following sequences: splice acceptor-IRES-GFP-poly A addition signal. 
This cassette is called SAIGP. The SAIGP element is inserted in a 3' to 5' orientation with 
respect to the viral LTR sequences, to prevent inappropriate splicing or termination of the 
5 viral genome during packaging. Zebrafish zygotes are injected with VSVg-pseudotyped, 

concentrated SAIGP lentivirus as described above. Fish are raised to sexual maturity and 
mated. The progeny are viewed with a fluorescent microscope, and GFP-expressing 
individuals are separated for further analysis. GFP-positive animals are then analyzed with 
a confocal fluorescent microscope to determine the spatial and temporal pattern of 

10 expression. Messenger RNA is extracted from those tissues of the animal that express GFP 

in the time and place of interest, and reverse transcription with oligonucleotides 
complementary to GFP yields a cDNA that should contain the sequences of the trapped 
gene that flank the provirus. The recovered cDNA is subcloned into an appropriate 
bacterial plasmid, and the gene that has been trapped by the SAIGFP provirus is identified 

15 by sequencing the upstream regions of the cDNA. 

Example 6 

Virus particles generated from the FUGW vector were generated as described 
above. The virus particles were injected using the peri vitelline injection method, also 

20 described above, into 4 fertilized rhesus monkey (Macaca mulatta) single cell embryos. 

Monkey oocytes can be fertilized directly with sperm or can be fertilized using the 
intracytoplasmic sperm injection (ICSI) method. Of the four embryos injected, 2 developed 
into blastocysts. Both blastocysts were green, evidencing expression of GFP. In the 
injected blastocysts, cells in the trophectoder (TE) and the inner cell mass (ICM) both were 

25 green. Non-injected control embryos were not green. Transformed embryos are transferred 

to host mothers for gestation. After approximately 150 to 175 days, a newborn rhesus 
monkey is delivered which expresses GFP throughout. Confirmation of the presence of the 
transgene and expression in various tissues is carried out as described above. 
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WHAT IS CLAIMED IS : 

1 . A method of producing a transgenic bird comprising: 
transfecting a packaging cell line with a retroviral construct; 

recovering recombinant retroviral particles from the packaging cell line; and 
infecting a bird egg with the recombinant retroviral particles, wherein the 

retroviral construct comprises the R and U5 sequences from a 5' lenti viral LTR and 

a self-inactivating 3' lentiviral LTR. 

2. The method of claim 1 wherein infecting a bird egg comprises contacting the 
embryonic blastodisc of the bird egg with the retroviral particles. 

3. The method of claim 1 wherein said retroviral construct additionally 
comprises an internal promoter. 

4. The method of claim 1 wherein said retroviral construct additionally 
comprises a gene of interest. 

5. The method of claim 4 wherein said transgenic animal expresses the gene of 
interest. 

6. The method of claim 1 wherein said packaging cell line is a 293 cell line. 

7. The method of claim 1 wherein the 5' LTR sequences are from HIV. 

8. The method of claim 1 wherein the self-inactivating 3* LTR comprises a U3 
element with a deletion of its enhancer sequence. 

9. The method of claim 8 wherein the self-inactivating 3' LTR is a modified 
HIV 3' LTR. 

1 0. The method of claim 1 wherein the recombinant retrovirus is pseudotyped. 

11. The method of claim 10 wherein the recombinant retrovirus is pseudotyped 
with the vesicular stomatitits virus envelope glycoprotein. 

12. The method of claim 1 wherein the viral construct additionally comprises a 
promoter operably linked to the R and U5 5' LTR sequences. 

1 3. The method of claim 12 wherein the promoter is a CMV promoter. 

14. The method of claim 12 wherein the viral construct additionally comprises 
an enhancer operably linked to the promoter. 

15. The method of claim 14 wherein the enhancer and promoter are CMV 
sequences. 
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16. The method of claim 1 wherein the viral construct additionally comprises 
the woodchuck hepatitis virus enhancer element sequence. 

17. The method of claim 1 wherein the viral construct additionally comprises a 
tRNA amber suppressor sequence. 

5 18. The method of claim 3 wherein the viral construct additionally comprises a 

reporter gene operably linked to the internal promoter. 

19. The method of claim 18 wherein the reporter gene encodes a fluorescent 

protein. 

20. The method of claim 19 wherein said fluorescent protein is green fluorescent 

10 protein. 

21. The method of claim 3 wherein the internal promoter is a ubiquitous 
promoter. 

22. The method of claim 21 wherein said ubiquitous promoter is selected from 
the group consisting of the ubiquitin promoter, the CMV p-actin promoter and the pgk 

1 5 promoter. 

23. The method of claim 3 wherein the internal promoter is a tissue specific 
promoter. 

24. The method of claim 23 wherein said tissue specific promoter is selected 
from the group consisting of the lck promoter, the myogenin promoter and the thyl 

20 promoter. 

25. A transgenic bird made by the method of claim 1 . 

26. A method of producing a transgenic bird comprising the following steps: 

a) opening a window in the shell of a fertilized bird egg; 

b) injecting modified retrovirus into the space between the perivitelline 
25 membrane and the embryonic blastodisc; and 

c) incubating the embryo until hatching. 

27. The method of claim 26 wherein the modified retrovirus is a modified 
lentivirus. 

30 28. The method of claim 27 wherein the modified lentivirus is produced by 

transfecting a packaging cell line with a viral construct. 
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29. The method of claim 28 wherein the viral construct comprises the R and U5 
sequences from a lentiviral 5' LTR, an internal promoter, a gene of interest, and a self- 
inactivating lentiviral 3' LTR. 

30. A transgenic bird wherein one or more germ cells comprises proviral DNA, 
said proviral DNA comprising a self-inactivating 3' lentiviral LTR. 

31. The transgenic bird of claim 30 wherein the self-inactivating 3' LTR is a 
self-inactivating 3' HTV LTR. 

32. The transgenic bird of claim 30 wherein the self-inactivating 3' HIV LTR 
comprises a U3 element with a deletion of its enhancer sequence. 

33. A method of producing a transgenic fish comprising: 
transfecting a packaging cell line with a viral construct; 

recovering recombinant retroviral particles from the packaging cell line; and 
infecting a fish egg with the recombinant retroviral particles, wherein the 

viral construct comprises the R and U5 sequences from a 5' lentiviral LTR and a 

self-inactivating 3' lentiviral LTR. 

34. The method of claim 33 wherein infecting a fish egg comprises delivering 
the retroviral particles to the space between the chorion and the cell membrane of the fish 
egg- 

35. A transgenic fish made by the method of claim 33. 

36. The method of claim 33 wherein said retroviral construct additionally 
comprises an internal promoter. 

37. The method of claim 33 wherein said retroviral construct additionally 
comprises a gene of interest. 

38. The method of claim 37 wherein said transgenic animal expresses the gene 
of interest. 

39. A transgenic fish whose genome comprises proviral DNA, said proviral 
DNA comprising a self-inactivating 3 ' lentiviral LTR. 

40. The transgenic fish of claim 39 wherein the self-inactivating 3' LTR is a 
self-inactivating 3 ' HIV LTR. 

41. The transgenic fish of claim 39 wherein the self-inactivating 3' HIV LTR 
comprises a U3 element with a deletion of its enhancer sequence. 
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SEQUENCE LISTING 

<110> CALIFORNIA INSTITUTE OF TECHNOLOGY 
BALTIMORE, DAVID 
HONG , ELIZABETH J. 
LO IS- CABALLE , CARLOS 
PEASE, SHIRLEY 

<120> METHOD FOR PRODUCING TRANSGENIC BIRDS AND FISH 



<130> CALTE . 0 0 9 VPC 

<150> 60/322,031 
<151> 2001-09-13 

<150> 60/347,782 
<151> 2002-01-09 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 178 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

acaaatggca gtattcatcc acaattttaa aagaaaaggg gggattgggg ggtacagtgc 60 
aggggaaaga atagtagaca taatagcaac agacatacaa actaaagaat tacaaaaaca 120 
aattacaaaa attcaaaatt ttcgggttta ttacagggac agcagagatc cagtttgg 178 

<210> 2 
<211> 1221 
<212> DNA 

<213> Homo sapiens 
<400> 2 

gggtgcagcg gcctccgcgc cgggttttgg cgcctcccgc gggcgccccc ctcctcacgg 60 
cgagcgctgc cacgtcagac gaagggcgca ggagcgttcc tgatccttcc gcccggacgc 120 
tcaggacagc ggcccgctgc tcataagact cggccttaga accccagtat cagcagaagg 180 
acattttagg acgggacttg ggtgactcta gggcactggt tttctttcca gagagcggaa 240 
caggcgagga aaagtagtcc cttctcggcg attctgcgga gggatctccg tggggcggtg 3 00 
aacgccgatg attatataag gacgcgccgg gtgtggcaca gctagttccg tcgcagccgg 360 
gatttgggtc gcggttcttg tttgtggatc gctgtgatcg tcacttggtg agttgcgggc 420 
tgctgggctg gccggggctt tcgtggccgc cgggccgctc ggtgggacgg aagcgtgtgg 4 80 
agagaccgcc aagggctgta gtctgggtcc gcgagcaagg ttgccctgaa ctgggggttg 540 
gggggagcgc acaaaatggc ggctgttccc gagtcttgaa tggaagacgc ttgtaaggcg 600 
ggctgtgagg tcgttgaaac aaggtggggg gcatggtggg cggcaagaac ccaaggtctt 660 
gaggccttcg ctaatgcggg aaagctctta ttcgggtgag atgggctggg gcaccatctg 720 
gggaccctga cgtgaagttt gtcactgact ggagaactcg ggtttgtcgt ctggttgcgg 780 
gggcggcagt tatgcggtgc cgttgggcag tgcacccgta cctttgggag cgcgcgcctc 84 0 
gtcgtgtcgt gacgtcaccc gttctgttgg cttataatgc agggtggggc cacctgccgg 900 
taggtgtgcg gtaggctttt ctccgtcgca ggacgcaggg ttcgggccta gggtaggctc 960 
tcctgaatcg acaggcgccg gacctctggt gaggggaggg ataagtgagg cgtcagtttc 1020 
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tttggtcggt tttatgtacc tatcttctta agtagctgaa gctccggttt tgaactatgc 1080 
gctcggggtt ggcgagtgtg ttttgtgaag ttttttaggc accttttgaa atgtaatcat 1140 
ttgggtcaat atgtaatttt cagtgttaga ctagtaaatt gtccgctaaa ttctggccgt 1200 
ttttggcttt tttgttagac a 1221 

<210> 3 
<211> 720 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence encodes for a green fluorescent 
protein variant. 



<400> 3 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 
ggcgacgtga acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 
ctcgtgacca ccttcaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 24 0 
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 42 0 
aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480 
ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540 
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 
ctgctggagt tcgtgaccgc cgccgggatc actcacggca tggacgagct gtacaagtaa 72 0 



<210> 4 
<211> 604 
<212> DNA 

<213> Woodchuck hepatitis virus 



<400> 4 

atcgataatc aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat 60 
gttgctcctt ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct 120 
tcccgtatgg ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag 180 
gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc 24 0 
cccactggtt ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc 300 
ctccctattg ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct 360 
cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga agctgacgtc ctttccatgg 42 0 
ctgctcgcct gtgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg 480 
gccctcaatc cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg 54 0 
cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcctgat 600 
cgat " ~ ~ 604 

<210> 5 
<211> 1119 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence encodes for a fusion protein between 
human histone 2B protein and a green fluorescent 
protein variant. 
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<400> 5 

accatgccag 

actaaggcgc 

atctatgtgt 

atgggcatca 

cgcctggcgc 

cgcctgctgc 

accaagtaca 

ctgttcaccg 

ttcagcgtgt 

atctgcacca 

ggcgtgcagt 

gccatgcccg 

aagacccgcg 

ggcatcgact 

agccacaacg 

atccgccaca 

cccatcggcg 

ctgagcaaag 

gccgggatca 



agccagcgaa 
agaagaaagg 
acaaggttct 
tgaattcgtt 
attacaacaa 
tgcctgggga 
ccagcgctaa 
gggtggtgcc 
ccggcgaggg 
ccggcaagct 
gcttcagccg 
aaggctacgt 
ccgaggtgaa 
tcaaggagga 
tctatatcat 
acatcgagga 
acggccccgt 
accccaacga 
ctctcggcat 



gtctgctccc 
cggcaagaag 
gaagcaggtc 
tgtgaacgac 
gcgctcgacc 
gttggccaag 
ggatccaccg 
catcctggtc 
cgagggcgat 
gcccgtgccc 
ctaccccgac 
ccaggagcgc 
gttcgagggc 
cggcaacatc 
ggccgacaag 
cggcagcgtg 
gctgctgccc 
gaagcgcgat 
ggacgagctg 



gccccgaaaa 
cgcaagcgca 
caccctgaca 
attttcgagc 
atcacctcca 
cacgccgtgt 
gtcgccacca 
gagctggacg 
gccacctacg 
tggcccaccc 
cacatgaagc 
accatcttct 
gacaccctgg 
ctggggcaca 
cagaagaacg 
cagctcgccg 
gacaaccact 
cacatggtcc 
tacaagtaa 



agggctccaa 
gccgcaagga 
ccggcatttc 
gcatcgcagg 
gggagatcca 
ccgagggtac 
tggtgagcaa 
gcgacgtaaa 
gcaagctgac 
tcgtgaccac 
agcacgactt 
tcaaggacga 
tgaaccgcat 
agctggagta 
gcatcaaggt 
accactacca 
acctgagcac 
tgctggagtt 



gaaggcggtg 
gagctattcc 
gtccaaggcc 
tgaggcttcc 
gacggccgtg 
taaggccatc 
gggcgaggag 
cggccacaag 
cctgaagttc 
cctgacctac 
cttcaagtcc 
cggcaactac 
cgagctgaag 
caactacaac 
gaacttcaag 
gcagaacacc 
ccagtccgcc 
cgtgaccgcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 6 
<211> 1276 
<212> DNA 
<213> Murine 



<400> 6 

tgagtcacca 

ctttttgttt 

tgtagccgtg 

ttgcaaaata 

tcgtgaaccc 

ccagcctgct 

ttcagatctc 

ataaaataaa 

ggaaggaggc 

agcaactcta 

tttccacagc 

acaggtttct 

ctgtggttga 

aaggagctgg 

ggtaccctcc 

aagtgtgtgt 

gcaggaagtg 

gtgggagggt 

agagggaacc 

ggggcctctg 

tgtctgatgt 

agggatcatg 



tgtgattgct 
gtttgtttgt 
tgtgtggggt 
gattaaacaa 
ctttattaaa 
ctacagaatg 
aaggccagcc 
ataaaagtta 
acacgggaat 
ctttactggc 
taaaccaggc 
ctctgctgct 
gtggtggggg 
aacctctcag 
ttggtggagg 
gatgtctccc 
ggtaactaga 
gggactagcc 
cagtcaggag 
agctgacgat 
tggggcgagt 
ggctgt 



gggaattgaa 
ttgtttgttt 
agccctgtgt 
ctgagagatg 
tctaacactc 
aatttcaggt 
aagactatgc 
atcttcgctt 
tccagaggct 
tgtgtctatg 
aagggagccg 
gagaagcaaa 
taggggtgct 
cttcggtggc 
agggtggaat 
aggtagtccc 
ctaacaaaga 
ttgggcctgg 
cttgaatccc 
ctcgggtact 
ggcttagggc 



ctcaagacct 
gggttttttt 
gtgagtgtgt 
gaataggtct 
agagacagga 
ctcaaggtca 
agtaagacct 
ggcaaaccga 
acagagggag 
aggttctgct 
aagtagacac 
agcctgtttg 
ggggttgggc 
taggcagggg 
gaaactctcg 
ccaaaggagg 
tgcctgcctg 
gcctcctgtg 
acgattcagc 
ttttgtaact 
cagctccttc 



ctggaagagc 
tttgagacag 
gtgtgtgtgt 
tcttgacatc 
gcatctctgc 
gcttggtcta 
tagctaaaat 
taattgagga 
cctcgctctg 
tgatttcatt 
agccacccgg 
aagaaactct 
tgaggctgag 
agttgtaatg 
gtttccccca 
aggctagcag 
tggcggtttg 
aacttggtgc 
gcttctgtct 
tccagaacag 
aggcctctct 



agtcagtgct 
ggtttctctg 
ttgtggtatg 
aaaaacatga 
aggtttgagg 
caaagtgagt 
aaataaataa 
ccagtgctca 
acctggttag 
tgacaaaaag 
gccgcgccca 
ctgaaggaga 
ggttgactct 
aagagggaca 
gaacttggca 
agctggggag 
cccatcccag 
ttgagggctc 
gcggccaatg 
ggctctagga 
acattccttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1276 



<210> 7 
<211> 1140 
<212> DNA 
<213> Murine 



<400> 7 

gtctctagct gcatatgtag cagaagatgg cctagtcggc catcattggg aagagaggcc 60 
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ccttggtatt gcaaactata tgccccagta caggggaacg ccagggccaa gaagtgggaa 120 
tgagtgggta ggggagcagg gcggggggag ggggggttag ggaacttttg ggatagcatt 180 
tgaaatgtaa atgaagaaaa tatctaataa aaaataattt aaaaaagagc gtcagacagg 240 
ggactgaaca gctcttgact aggggagaag aaggcaatgt agagtagtct gtgagttcta 300 
atccttgcta aacactgact tcacctgacc cctactactt aaggcccccc cccttactta 360 
agaagtccct gtgttctctt acttcaatct acccccaaca tcatgagacc tggtcaaaga 420 
agctgtagaa acccaaaagt tgaatccatt tgcccttctg ggtttctgtc tttgcctcca 480 
tggacgatag ggacacacac acacacacac acacacacac acacacacac acgccccaaa 540 
tctggagtgg tcctgatgtg gtagtggtag gtctttaggg gtctcatggg actgacatag 600 
tatggtttaa ggtgctgctg agcaggaaag agaaggctaa gtggattttc aagacccctt 660 
cccgtccgtc caagacaacc cctttcttgt tcccttcctg ccctgtccac cagctgcctt 720 
ggaccatgga ggagagagta ggcaggaggc ccgggtagga gtaattgaaa ggagcagatg 780 
agacggggga atgcacccac ccccaccttc cctgccccac aggggctgtg gagaaatgaa 840 
aactaatcaa attacagccg acggcctccc gacccgtgca caggagccgc ctgggccagg 900 
ggcaggcctg cagggtgggg tgggggcaaa aggagaggga aggggaatca catgtaacca 960 
ctggaaacgt cttgatgtgc agcaacagct tagagggggg ctcaggtttc tgtggcgttg 102 0 
gctatattta tctctgggtt catgccagca gggagggttt aaatggcacc cagcagttgg 1080 
tgtgaggggc tgcgggagct tgggggccag tggcaggaac aagccttttg cgacctgatg 114 0 



<210> 8 
<211> 2853 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence represents a construct comprising a 
human immunodeficiency virus sequence, a green 
fluorescent protein variant sequence, a human 
ubiquitin promoter sequence and a woodchuck 
hepatitis regulator element sequence. 

<400> 8 

ctgcagacaa atggcagtat tcatccacaa ttttaaaaga aaagggggga ttggggggta 60 
cagtgcaggg gaaagaatag tagacataat agcaacagac atacaaacta aagaattaca 12 0 
aaaacaaatt acaaaaattc aaaattttcg ggtttattac agggacagca gagatccagt 180 
ttggctgcag ttaattaaag atctgggtgc agcggcctcc gcgccgggtt ttggcgcctc 24 0 
ccgcgggcgc ccccctcctc acggcgagcg ctgccacgtc agacgaaggg cgcaggagcg 300 
ttcctgatcc ttccgcccgg acgctcagga cagcggcccg ctgctcataa gactcggcct 360 
tagaacccca gtatcagcag aaggacattt taggacggga cttgggtgac tctagggcac 420 
tggttttctt tccagagagc ggaacaggcg aggaaaagta gtcccttctc ggcgattctg 480 
cggagggatc tccgtggggc ggtgaacgcc gatgattata taaggacgcg ccgggtgtgg 54 0 
cacagctagt tccgtcgcag ccgggatttg ggtcgcggtt cttgtttgtg gatcgctgtg 600 
atcgtcactt ggtgagttgc gggctgctgg gctggccggg gctttcgtgg ccgccgggcc 660 
gctcggtggg acggaagcgt gtggagagac cgccaagggc tgtagtctgg gtccgcgagc 720 
aaggttgccc tgaactgggg gttgggggga gcgcacaaaa tggcggctgt tcccgagtct 780 
tgaatggaag acgcttgtaa ggcgggctgt gaggtcgttg aaacaaggtg gggggcatgg 84 0 
tgggcggcaa gaacccaagg tcttgaggcc ttcgctaatg cgggaaagct cttattcggg 900 
tgagatgggc tggggcacca tctggggacc ctgacgtgaa gtttgtcact gactggagaa 960 
ctcgggtttg tcgtctggtt gcgggggcgg cagttatgcg gtgccgttgg gcagtgcacc 1020 
cgtacctttg ggagcgcgcg cctcgtcgtg tcgtgacgtc acccgttctg ttggcttata 1080 
atgcagggtg gggccacctg ccggtaggtg tgcggtaggc ttttctccgt cgcaggacgc 1140 
agggttcggg cctagggtag gctctcctga atcgacaggc gccggacctc tggtgagggg 1200 
agggataagt gaggcgtcag tttctttggt cggttttatg tacctatctt cttaagtagc 1260 
tgaagctccg gttttgaact atgcgctcgg ggttggcgag tgtgttttgt gaagtttttt 1320 
aggcaccttt tgaaatgtaa tcatttgggt caatatgtaa ttttcagtgt tagactagta 1380 
aattgtccgc taaattctgg ccgtttttgg cttttttgtt agacaaagct tctgcaggtc 144 0 
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gactctagag 
gtggtgccca 
ggcgagggcg 
ggcaagctgc 
ttcagccgct 
ggctacgtcc 
gaggtgaagt 
aaggaggacg 
tatatcatgg 
atcgaggacg 
ggccccgtgc 
cccaacgaga 
cacggcatgg 
tcgatatcga 
actatgttgc 
ttgcttcccg 
atgaggagtt 
caacccccac 
tccccctccc 
gggctcggct 
catggctgct 
cttcggccct 
ttccgcgtct 
ctgatcgata 



gatcccccgg 
tcctggtcga 
agggcgatgc 
ccgtgccctg 
accccgacca 
aggagcgcac 
tcgagggcga 
gcaacatcct 
ccgacaagca 
gcagcgtgca 
tgctgcccga 
agcgcgatca 
acgagctgta 
taatcaacct 
tccttttacg 
tatggctttc 
gtggcccgtt 
tggttggggc 
tattgccacg 
gttgggcact 
cgcctgtgtt 
caatccagcg 
tcgccttcgc 
tcgatgtcga 



gggtaccatg 
gctggacggc 
cacctacggc 
gcccaccctc 
catgaagcag 
catcttcttc 
caccctggtg 
ggggcacaag 
gaagaacggc 
gctcgccgac 
caaccactac 
catggtcctg 
caagtaagcg 
ctggattaca 
ctatgtggat 
attttctcct 
gtcaggcaac 
attgccacca 
gcggaactca 
gacaattccg 
gccacctgga 
gaccttcctt 
cctcagacga 
cctcgagggt 



gtgagcaagg 
gacgtgaacg 
aagctgaccc 
gtgaccacct 
cacgacttct 
aaggacgacg 
aaccgcatcg 
ctggagtaca 
atcaaggtga 
cactaccagc 
ctgagcaccc 
ctggagttcg 
gccgctctag 
aaatttgtga 
acgctgcttt 
ccttgtataa 
gtggcgtggt 
cctgtcagct 
tcgccgcctg 
tggtgttgtc 
ttctgcgcgg 
cccgcggcct 
gtcggatctc 
acc 



gcgaggagct 
gccacaagtt 
tgaagttcat 
tcacctacgg 
tcaagtccgc 
gcaactacaa 
agctgaaggg 
actacaacag 
acttcaagat 
agaacacccc 
agtccgccct 
tgaccgccgc 
agaattcgat 
aagattgact 
aatgcctttg 
atcctggttg 
gtgcactgtg 
cctttccggg 
ccttgcccgc 
ggggaagctg 
gacgtccttc 
gctgccggct 
cctttgggcc 



gttcaccggg 
cagcgtgtcc 
ctgcaccacc 
cgtgcagtgc 
catgcccgaa 
gacccgcgcc 
catcgacttc 
ccacaacgtc 
ccgccacaac 
catcggcgac 
gagcaaagac 
cgggatcact 
atcaagctta 
ggtattctta 
tatcatgcta 
ctgtctcttt 
tttgctgacg 
actttcgctt 
tgctggacag 
acgtcctttc 
tgctacgtcc 
ctgcggcctc 
gcctccccgc 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2853 
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